Skip to content
You must be logged in to sponsor urduhack

Become a sponsor to UrduHack

Urduhack: A Python NLP library for the Urdu language

Urduhack is an NLP library for the Urdu language. It comes with a lot of battery included features to help you process Urdu data in the easiest way possible.

Our Goal

  • Academic users Easier experimentation to prove their hypothesis without coding from scratch.
  • NLP beginners Learn how to build an NLP project with production-level code quality.
  • NLP developers Build a production-level application within minutes.

πŸ”₯ Features Support

  • [x] Normalization
  • [x] Tokenization
  • [x] Preprocessing
  • [x] Pipeline Module
  • [x] Models
    • [x] Pos tagger
    • [ ] Sentimental analysis
    • [ ] Sentence classification
    • [ ] Documents classification
    • [ ] Name entity recognition
    • [ ] Image to text
    • [ ] Speech to text
  • [x] Datasets loader

πŸ›  Installation

Urduhack officially supports Python 3.6–3.7 and runs great on PyPy.

Installing with tensorflow cpu version.

$ pip install urduhack[tf]

Installing with tensorflow gpu version.

$ pip install urduhack[tf-gpu]

Usage

import urduhack

# Downloading models
urduhack.download()

nlp = urduhack.Pipeline()
text = ""
doc = nlp(text)

for sentence in doc.sentences:
    print(sentence.text)
    for word in sentence.words:
        print(word)

πŸ”— Documentation

Fantastic documentation is available at https://urduhack.readthedocs.io/

Documentation
Installation How to install Urduhack and download models
Quickstart New to Urduhack? Here's everything you need to know!
API Reference The detailed reference for Urduhack's API.

How to Contribute

  1. Check for open issues or open a fresh issue to start a discussion
    around a feature idea or a bug. There is a Contributor Friendly
    tag for issues that should be ideal for people who are not very
    familiar with the codebase yet.
  2. Write a test which shows that the bug was fixed or that the feature
    works as expected.
  3. Send a pull request and bug the maintainer until it gets merged and
    published. :)

πŸ‘ Contributors

Special thanks to everyone who contributed to getting the UrduHack to the current state.

@urduhack

The goal is to maintain GPU Machines, Hosting and Data labelling team cost. Achieving this goal by contributions from sponsorship will help me to accomplish NLP goals for the Urdu language.

Featured work

  1. urduhack/urduhack

    An NLP library for the Urdu language. It comes with a lot of battery included features to help you process Urdu data in the easiest way possible.

    Python 283

0% towards $100 per month goal

Be the first to sponsor this goal!

Select a tier

$ one time

Choose a custom amount.