Skip to content

Problem in nltkvid18.py (the charset shows unknown 8 bit in positive.txt and negative.txt) #3

@pratimaupadhyay02

Description

@pratimaupadhyay02

Sir
When running nltkvid18.py, the following error is encountered:
UnicodeDecodeError: 'ascii' codec can't decode byte 0xed in position 6: ordinal not in range(128)
I am attaching the screenshot here
issue1
I found out that the files positive.txt and negative.txt show unknown charset type instead of ascii or utf-8 so the file was not able to read it.
I used " cat -v negative.txt > negative1.txt" and cat -v positive.txt > positive1.txt inorder to change the unknown charset type to ascii in a new file named positive1.txt and negative1.txt . After using these files it worked.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions