AD2000X

Po AD2000X

7 followers · 122 following

Computational Linguist
London, UK
21:33 (UTC)

Achievements

Stars

Data

Dataset, Converter, Text, Augmentation, Tagging,

20 repositories

kothiyayogesh / medium-article-code

Jupyter Notebook 42 30 Updated May 15, 2019

juand-r / entity-recognition-datasets

A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.

Python 1,527 247 Updated Nov 29, 2024

facebookresearch / AugLy

A data augmentations library for audio, image, text, and video.

Python 4,990 303 Updated Feb 28, 2025

DS4SD / docling

Get your documents ready for gen AI

Python 23,200 1,340 Updated Mar 3, 2025

signal-ai / Signal-1M-Tools

Python 49 17 Updated Sep 3, 2019

williamgilpin / dysts

Hundreds of strange attractors

Python 439 69 Updated Feb 10, 2025

CVHub520 / X-AnyLabeling

Effortless data labeling with AI support from Segment Anything and other awesome models.

Python 4,934 558 Updated Feb 26, 2025

voxel51 / zcore

Shell 66 6 Updated Jan 13, 2025

voxel51 / fiftyone

Refine high-quality datasets and visual AI models

Python 9,245 605 Updated Mar 3, 2025

HumanSignal / label-studio

Label Studio is a multi-type data labeling and annotation tool with standardized output format

JavaScript 20,974 2,578 Updated Mar 3, 2025

crate / academy-fundamentals-course

GitHub repository accompanying the CrateDB Fundamentals Course at the CrateDB Academy.

Jupyter Notebook 6 3 Updated Mar 3, 2025

crate / crate

CrateDB is a distributed and scalable SQL database for storing and analyzing massive amounts of data in near real-time, even with complex queries. It is PostgreSQL-compatible, and based on Lucene.

Java 4,192 575 Updated Mar 3, 2025