Question: Why NLTK TweetTokenizer?

Thanks for your work on this nice project. 

I intend to create a library for text simplification, and potentially would like to integrate your package.
The selection of a tokenizer has an impact on the obtained readability scores and I was wondering how you approached this issue.

Was there any specific reason for choosing the Tweet-Tokenizer over e.g. the default/recommended Nltk-Tokenizer which better depicts the Penn Treebank's definition of word-boundaries? 
https://github.com/cdimascio/py-readability-metrics/blob/3ffb97f6057ae2451599d083a69ece78a61a6fa4/readability/text/analyzer.py#L128

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Question: Why NLTK TweetTokenizer? #26

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Uh oh!

Question: Why NLTK TweetTokenizer? #26

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions