John Snow Labs Spark NLP is a natural language processing library built on top of Apache Spark ML. It provides simple, performant & accurate NLP annotations for machine learning pipelines, that scale easily in a distributed environment.
Before you start, install anaconda-client and conda-build:
conda install anaconda-client conda-build
Make sure you are logged in as JohnSnowLabs
conda login
Purge the previous builds:
conda build purge
Turn off auto-upload:
conda config --set anaconda_upload no
Build spark-nlp
from the latest PyPI tar:
conda build . --python=3.6 && conda build . --python=3.7
Example of uploading Conda package to Anaconda Cloud:
anaconda upload /anaconda3/conda-bld/noarch/spark-nlp-version-py36_0.tar.bz2
Install spark-nlp by using conda:
conda install -c johnsnowlabs spark-nlp
https://github.com/JohnSnowLabs/spark-nlp
Take a look at our official spark-nlp page: http://nlp.johnsnowlabs.com/ for user documentation and examples
Apache Licence 2.0