You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The FeatureGenerator currently implements feature extraction (reading files, tokenization), postprocessing (case sensitivity, ngrams), and the bag-of-words stuff (feature counting, merging into a dataframe).
For some implementations (zeta, chunking, topic modelling?) we need the tokenized data that has not yet been merged into the bag of words model.
The text was updated successfully, but these errors were encountered:
The
FeatureGenerator
currently implements feature extraction (reading files, tokenization), postprocessing (case sensitivity, ngrams), and the bag-of-words stuff (feature counting, merging into a dataframe).For some implementations (zeta, chunking, topic modelling?) we need the tokenized data that has not yet been merged into the bag of words model.
The text was updated successfully, but these errors were encountered: