Skip to content

200+ State of the Art Medical Models for NER, Entity Resolution, Relation Extraction, Assertion, Spark 3 and Python 3.8 support - John Snow Labs NLU 3.0.0

Compare
Choose a tag to compare
@C-K-Loan C-K-Loan released this 19 Apr 15:57
· 1017 commits to master since this release
c3dd901

200+ State of the Art Medical Models for NER, Entity Resolution, Relation Extraction, Assertion, Spark 3 and Python 3.8 support in NLU 3.0 Release and much more

We are incredibly excited to announce the release of NLU 3.0.0 which makes most of John Snow Labs medical healthcare model available in just 1 line of code in NLU.
These models are the most accurate in their domains and highly scalable in Spark clusters.
In addition, Spark 3.0.X and Spark 3.1.X is now supported, together with Python3.8

This is enabled by the amazing Spark NLP3.0.1 and Spark NLP for Healthcare 3.0.1 releases.

New Features

  • Over 200 new models for the healthcare domain
  • 6 new classes of models, Assertion, Sentence/Chunk Resolvers, Relation Extractors, Medical NER models, De-Identificator Models
  • Spark 3.0.X and 3.1.X support
  • Python 3.8 Support
  • New Output level relation
  • 1 Line to install NLU just run !wget https://raw.githubusercontent.com/JohnSnowLabs/nlu/master/scripts/colab_setup.sh -O - | bash
  • Various new EMR and Databricks versions supported
  • GPU Mode, more then 600% speedup by enabling GPU mode.
  • Authorized mode for licensed features

New Documentation

New Notebooks

AssertionDLModels

Language nlu.load() reference Spark NLP Model reference
English assert assertion_dl
English assert.biobert assertion_dl_biobert
English assert.healthcare assertion_dl_healthcare
English assert.large assertion_dl_large

New Word Embeddings

Language nlu.load() reference Spark NLP Model reference
English embed.glove.clinical embeddings_clinical
English embed.glove.biovec embeddings_biovec
English embed.glove.healthcare embeddings_healthcare
English embed.glove.healthcare_100d embeddings_healthcare_100d
English en.embed.glove.icdoem embeddings_icdoem
English en.embed.glove.icdoem_2ng embeddings_icdoem_2ng

Sentence Entity resolvers

Language nlu.load() reference Spark NLP Model reference
English embed_sentence.biobert.mli sbiobert_base_cased_mli
English resolve sbiobertresolve_cpt
English resolve.cpt sbiobertresolve_cpt
English resolve.cpt.augmented sbiobertresolve_cpt_augmented
English resolve.cpt.procedures_augmented sbiobertresolve_cpt_procedures_augmented
English resolve.hcc.augmented sbiobertresolve_hcc_augmented
English resolve.icd10cm sbiobertresolve_icd10cm
English resolve.icd10cm.augmented sbiobertresolve_icd10cm_augmented
English resolve.icd10cm.augmented_billable sbiobertresolve_icd10cm_augmented_billable_hcc
English resolve.icd10pcs sbiobertresolve_icd10pcs
English resolve.icdo sbiobertresolve_icdo
English resolve.rxcui sbiobertresolve_rxcui
English resolve.rxnorm sbiobertresolve_rxnorm
English resolve.snomed sbiobertresolve_snomed_auxConcepts
English resolve.snomed.aux_concepts sbiobertresolve_snomed_auxConcepts
English resolve.snomed.aux_concepts_int sbiobertresolve_snomed_auxConcepts_int
English resolve.snomed.findings sbiobertresolve_snomed_findings
English resolve.snomed.findings_int sbiobertresolve_snomed_findings_int

RelationExtractionModel

Language nlu.load() reference Spark NLP Model reference
English relation.posology posology_re
English relation redl_bodypart_direction_biobert
English relation.bodypart.direction redl_bodypart_direction_biobert
English relation.bodypart.problem redl_bodypart_problem_biobert
English relation.bodypart.procedure redl_bodypart_procedure_test_biobert
English relation.chemprot redl_chemprot_biobert
English relation.clinical redl_clinical_biobert
English relation.date redl_date_clinical_biobert
English relation.drug_drug_interaction redl_drug_drug_interaction_biobert
English relation.humen_phenotype_gene redl_human_phenotype_gene_biobert
English relation.temporal_events redl_temporal_events_biobert

NERDLModels

Language nlu.load() reference Spark NLP Model reference
English med_ner.ade.clinical ner_ade_clinical
English med_ner.ade.clinical_bert ner_ade_clinicalbert
English med_ner.ade.ade_healthcare ner_ade_healthcare
English med_ner.anatomy ner_anatomy
English med_ner.anatomy.biobert ner_anatomy_biobert
English med_ner.anatomy.coarse ner_anatomy_coarse
English med_ner.anatomy.coarse_biobert ner_anatomy_coarse_biobert
English med_ner.aspect_sentiment ner_aspect_based_sentiment
English med_ner.bacterial_species ner_bacterial_species
English med_ner.bionlp ner_bionlp
English med_ner.bionlp.biobert ner_bionlp_biobert
English med_ner.cancer ner_cancer_genetics
Englishs med_ner.cellular ner_cellular
English med_ner.cellular.biobert ner_cellular_biobert
English med_ner.chemicals ner_chemicals
English med_ner.chemprot ner_chemprot_biobert
English med_ner.chemprot.clinical ner_chemprot_clinical
English med_ner.clinical ner_clinical
English med_ner.clinical.biobert ner_clinical_biobert
English med_ner.clinical.noncontrib ner_clinical_noncontrib
English med_ner.diseases ner_diseases
English med_ner.diseases.biobert ner_diseases_biobert
English med_ner.diseases.large ner_diseases_large
English med_ner.drugs ner_drugs
English med_ner.drugsgreedy ner_drugs_greedy
English med_ner.drugs.large ner_drugs_large
English med_ner.events_biobert ner_events_biobert
English med_ner.events_clinical ner_events_clinical
English med_ner.events_healthcre ner_events_healthcare
English med_ner.financial_contract ner_financial_contract
English med_ner.healthcare ner_healthcare
English med_ner.human_phenotype.gene_biobert ner_human_phenotype_gene_biobert
English med_ner.human_phenotype.gene_clinical ner_human_phenotype_gene_clinical
English med_ner.human_phenotype.go_biobert ner_human_phenotype_go_biobert
English med_ner.human_phenotype.go_clinical ner_human_phenotype_go_clinical
English med_ner.jsl ner_jsl
English med_ner.jsl.biobert ner_jsl_biobert
English med_ner.jsl.enriched ner_jsl_enriched
English med_ner.jsl.enriched_biobert ner_jsl_enriched_biobert
English med_ner.measurements ner_measurements_clinical
English med_ner.medmentions ner_medmentions_coarse
English med_ner.posology ner_posology
English med_ner.posology.biobert ner_posology_biobert
English med_ner.posology.greedy ner_posology_greedy
English med_ner.posology.healthcare ner_posology_healthcare
English med_ner.posology.large ner_posology_large
English med_ner.posology.large_biobert ner_posology_large_biobert
English med_ner.posology.small ner_posology_small
English med_ner.radiology ner_radiology
English med_ner.radiology.wip_clinical ner_radiology_wip_clinical
English med_ner.risk_factors ner_risk_factors
English med_ner.risk_factors.biobert ner_risk_factors_biobert
English med_ner.i2b2 nerdl_i2b2
English med_ner.tumour nerdl_tumour_demo
English med_ner.jsl.wip.clinical jsl_ner_wip_clinical
English med_ner.jsl.wip.clinical.greedy jsl_ner_wip_greedy_clinical
English med_ner.jsl.wip.clinical.modifier jsl_ner_wip_modifier_clinical
English med_ner.jsl.wip.clinical.rd jsl_rd_ner_wip_greedy_clinical

De-Identification Models

Language nlu.load() reference Spark NLP Model reference
English med_ner.deid.augmented ner_deid_augmented
English med_ner.deid.biobert ner_deid_biobert
English med_ner.deid.enriched ner_deid_enriched
English med_ner.deid.enriched_biobert ner_deid_enriched_biobert
English med_ner.deid.large ner_deid_large
English med_ner.deid.sd ner_deid_sd
English med_ner.deid.sd_large ner_deid_sd_large
English med_ner.deid nerdl_deid
English med_ner.deid.synthetic ner_deid_synthetic
English med_ner.deid.dl ner_deidentify_dl
English en.de_identify deidentify_rb
English de_identify.rules deid_rules
English de_identify.clinical deidentify_enriched_clinical
English de_identify.large deidentify_large
English de_identify.rb deidentify_rb
English de_identify.rb_no_regex deidentify_rb_no_regex

Chunk resolvers

Language nlu.load() reference Spark NLP Model reference
English resolve_chunk.athena_conditions chunkresolve_athena_conditions_healthcare
English resolve_chunk.cpt_clinical chunkresolve_cpt_clinical
English resolve_chunk.icd10cm.clinical chunkresolve_icd10cm_clinical
English resolve_chunk.icd10cm.diseases_clinical chunkresolve_icd10cm_diseases_clinical
English resolve_chunk.icd10cm.hcc_clinical chunkresolve_icd10cm_hcc_clinical
English resolve_chunk.icd10cm.hcc_healthcare chunkresolve_icd10cm_hcc_healthcare
English resolve_chunk.icd10cm.injuries chunkresolve_icd10cm_injuries_clinical
English resolve_chunk.icd10cm.musculoskeletal chunkresolve_icd10cm_musculoskeletal_clinical
English resolve_chunk.icd10cm.neoplasms chunkresolve_icd10cm_neoplasms_clinical
English resolve_chunk.icd10cm.poison chunkresolve_icd10cm_poison_ext_clinical
English resolve_chunk.icd10cm.puerile chunkresolve_icd10cm_puerile_clinical
English resolve_chunk.icd10pcs.clinical chunkresolve_icd10pcs_clinical
English resolve_chunk.icdo.clinical chunkresolve_icdo_clinical
English resolve_chunk.loinc chunkresolve_loinc_clinical
English resolve_chunk.rxnorm.cd chunkresolve_rxnorm_cd_clinical
English resolve_chunk.rxnorm.in chunkresolve_rxnorm_in_clinical
English resolve_chunk.rxnorm.in_healthcare chunkresolve_rxnorm_in_healthcare
English resolve_chunk.rxnorm.sbd chunkresolve_rxnorm_sbd_clinical
English resolve_chunk.rxnorm.scd chunkresolve_rxnorm_scd_clinical
English resolve_chunk.rxnorm.scdc chunkresolve_rxnorm_scdc_clinical
English resolve_chunk.rxnorm.scdc_healthcare chunkresolve_rxnorm_scdc_healthcare
English resolve_chunk.rxnorm.xsmall.clinical chunkresolve_rxnorm_xsmall_clinical
English resolve_chunk.snomed.findings chunkresolve_snomed_findings_clinical

New Classifiers

Language nlu.load() reference Spark NLP Model reference
English classify.icd10.clinical classifier_icd10cm_hcc_clinical
English classify.icd10.healthcare classifier_icd10cm_hcc_healthcare
English classify.ade.biobert classifierdl_ade_biobert
English classify.ade.clinical classifierdl_ade_clinicalbert
English classify.ade.conversational classifierdl_ade_conversational_biobert
English classify.gender.biobert classifierdl_gender_biobert
English classify.gender.sbert classifierdl_gender_sbert
English classify.pico classifierdl_pico_biobert

German Medical models

nlu.load() reference Spark NLP Model reference
[embed] w2v_cc_300d
[embed.w2v] w2v_cc_300d
[resolve_chunk] chunkresolve_ICD10GM
[resolve_chunk.icd10gm] chunkresolve_ICD10GM
resolve_chunk.icd10gm.2021 chunkresolve_ICD10GM_2021
med_ner.legal ner_legal
med_ner ner_healthcare
med_ner.healthcare ner_healthcare
med_ner.healthcare_slim ner_healthcare_slim
med_ner.traffic ner_traffic

Spanish Medical models

nlu.load() reference Spark NLP Model reference
embed.scielo.150d embeddings_scielo_150d
embed.scielo.300d embeddings_scielo_300d
embed.scielo.50d embeddings_scielo_50d
embed.scielowiki.150d embeddings_scielowiki_150d
embed.scielowiki.300d embeddings_scielowiki_300d
embed.scielowiki.50d embeddings_scielowiki_50d
embed.sciwiki.150d embeddings_sciwiki_150d
embed.sciwiki.300d embeddings_sciwiki_300d
embed.sciwiki.50d embeddings_sciwiki_50d
med_ner ner_diag_proc
med_ner.neoplasm ner_neoplasms
med_ner.diag_proc ner_diag_proc

GPU Mode

You can now enable NLU GPU mode by setting gpu=true while loading a model. I.e. nlu.load('train.sentiment' gpu=True) . If must resart you kernel, if you already loaded a nlu pipeline withouth GPU mode.

Output Level Relation

This new output level is used for relation extractors and will give you 1 row per relation extracted.

Bug fixes

  • Fixed a bug that caused loading NLU models in offline mode not to work in some occasions

1 line Install NLU

!wget https://raw.githubusercontent.com/JohnSnowLabs/nlu/master/scripts/colab_setup.sh -O - | bash

Install via PIP

! pip install nlu pyspark==3.0.1

Additional NLU ressources