Skip to content

apply mtmsn in DRCD chinese corpus #7

@allenyummy

Description

@allenyummy

I try to apply mtmsn in DRCD chinese corpus, and find out that "bert.tokenization.FullTokenizer" can't handle the chinese word tokenization. Is that why I can't use mtmsn in Chinese corpus ?

But I see the "bert.tokenization" can handle the chinese word, I dont know what the problem is.

Is there any help ?
Thanks.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions