Skip to content

Releases: JohnSnowLabs/nlu

John Snow Labs NLU 1.0.3 trainable models, offline mode, sentence similarity

20 Nov 07:38
3f565eb
Compare
Choose a tag to compare

1.0.3 Release Notes

We are happy to announce NLU 1.0.3 has been released and comes with a lot new features, training classifiers, saving them and loading them offline, enabling running NLU with no internet connection, new notebooks and articles!

NLU 1.0.3 New Features

  • Train a Deep Learning classifier in 1 line! The popular ClassifierDL
    which can achieve state of the art results on any multi class text classification problem is now trainable!
    All it takes is just nlu.load('train.classifier).fit(dataset) . Your dataset can be a Pandas/Spark/Modin/Ray/Dask dataframe and needs to have a column named 'text" for text data and a column named 'y' for labels
  • Saving pipelines to HDD is now possible with nlu.save(path)
  • Loading pipelines from disk now possible with nlu.load(path=path).
  • NLU offline mode: Loading from disk makes running NLU offline now possible, since you can load pipelines/models from your local hard drive instead of John Snow Labs AWS servers.

NLU 1.0.3 New Notebooks and Tutorials

NLU 1.0.3 Bug fixes

  • Sentence Detector bugfix

nlu1.0.2

27 Oct 23:26
e6339a9
Compare
Choose a tag to compare

We are glad to announce nlu 1.0.2 is released!

NLU 1.0.2 Enhancements

  • More semantically concise output levels sentence and document enforced :
    • If a pipe is set to output_level='document' :
      • Every Sentence Embedding will generate 1 Embedding per Document/row in the input Dataframe, instead of 1 embedding per sentence.
      • Every Classifier will classify an entire Document/row
      • Each row in the output DF is a 1 to 1 mapping of the original input DF. 1 to 1 mapping from input to output.
    • If a pipe is set to output_level='sentence' :
      • Every Sentence Embedding will generate 1 Embedding per Sentence,
      • Every Classifier will classify exactly one sentence
      • Each row in the output DF can is mapped to one row in the input DF, but one row in the input DF can have multiple corresponding rows in the output DF. 1 to N mapping from input to output.
  • Improved generation of column names for classifiers. based on input nlu reference
  • Improved generation of column names for embeddings, based on input nlu reference
  • Improved automatic output level inference
  • Various test updates
  • Integration of CI pipeline with Github Actions

New Documentation is out!

Check it out here : http://nlu.johnsnowlabs.com/

nlu1.0.1

08 Oct 02:04
Compare
Choose a tag to compare
  • Fixed bug that caused NER pipelines to crash in NLU when input string caused the NER model to predict without additional metadata

nlu1.0.0

07 Oct 03:27
Compare
Choose a tag to compare

1.0 Release Notes

We are glad to announce that NLU 1.0.0 has been released!

Changes and Updates:

350+ NLP Models in 1 line of Python code with NLU 0.2.1

21 Sep 12:21
781fa58
Compare
Choose a tag to compare

We are glad to announce that NLU 0.2.1 has been released!
NLU makes the 350+ models and annotators in Spark NLPs arsenal available in just 1 line of python code and it works with Pandas dataframes!

What is included in NLU 0.2.1

  • 350+ pre-trained models
  • 100+ of the latest NLP word embeddings ( BERT, ELMO, ALBERT, XLNET, GLOVE, BIOBERT, ELECTRA, COVIDBERT) and different variations of them
  • 50+ of the latest NLP sentence embeddings ( BERT, ELECTRA, USE) and different variations of them
  • 50+ Classifiers
  • 50+ Supported languages
  • Named Entity Recognition (NER)
  • Part of Speech(POS)
  • Labeled and Unlabeled Dependency parsing
  • Spell Checking
  • Various Text Preprocessing and Cleaning algorithms

NLU References

NLU on Medium

NLU component examples

The following are Collab examples that showcase each NLU component and some applications.

Simple NLU Output Level Examples