Skip to content
View henryre's full-sized avatar
  • San Francisco, CA

Organizations

@HazyResearch @stanford-futuredata @snorkel-team

Block or report henryre

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Resources for Data Centric AI

TeX 1,108 117 Updated Dec 13, 2023

A collection of utilities for writing labeling functions, transformation functions, and slicing functions.

Python 20 7 Updated Apr 22, 2020

The Big List of Naughty Strings is a list of strings which have a high probability of causing issues when used as user-input data.

Python 46,980 2,152 Updated Apr 18, 2024

FastAPI framework, high performance, easy to learn, fast to code, ready for production

Python 81,497 7,014 Updated Mar 1, 2025

A collection of tutorials for Snorkel

Python 394 181 Updated Nov 20, 2024

Ultimate Plumber is a tool for writing Linux pipes with instant live preview

Go 8,438 129 Updated Sep 5, 2024

Hyperparameter Experiments with TensorFlow and Keras

Python 1,630 268 Updated Apr 22, 2024

Snorkel MeTaL: A framework for training models with multi-task weak supervision

Python 423 79 Updated Sep 16, 2019

A library for efficient similarity search and clustering of dense vectors.

C++ 33,332 3,772 Updated Mar 1, 2025

A library for Multilingual Unsupervised or Supervised word Embeddings

Python 3,205 557 Updated Aug 31, 2022

DyNet: The Dynamic Neural Network Toolkit

C++ 3,426 703 Updated Dec 1, 2023

Open standard for machine learning interoperability

Python 18,515 3,722 Updated Mar 1, 2025

Caffe2 is a lightweight, modular, and scalable deep learning framework.

Shell 8,419 1,938 Updated Feb 7, 2023

An open-source C++ library developed and used at Facebook.

C++ 29,028 5,640 Updated Mar 1, 2025

An open-source NLP research library, built on PyTorch.

Python 11,818 2,251 Updated Nov 22, 2022

Learning to Compose Domain-Specific Transformations for Data Augmentation

Python 172 29 Updated Nov 21, 2022

Super simple fit method for PyTorch Modules

Python 101 18 Updated Jun 24, 2020

Deep Learning for humans

Python 62,634 19,527 Updated Feb 28, 2025

Programming exercises for the Stanford Unsupervised Feature Learning and Deep Learning Tutorial

2,608 1,597 Updated May 12, 2021

Data and code behind the articles and graphics at FiveThirtyEight

Jupyter Notebook 16,931 10,934 Updated Feb 25, 2025

A brazen two-column theme for Jekyll.

CSS 3,709 4,000 Updated Jul 3, 2024

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 35,719 6,050 Updated Mar 1, 2025

Some notes on things I find interesting and important.

JavaScript 1,979 178 Updated Feb 10, 2025

Collection of tools for building diachronic/historical word vectors

Python 423 92 Updated Dec 18, 2023

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 87,397 23,483 Updated Mar 1, 2025

Semantic Parser with Execution

Java 833 299 Updated May 1, 2023

MacroBase: A Search Engine for Fast Data

Java 664 126 Updated Dec 14, 2022

High-performance runtime for data analytics applications

Rust 2,996 256 Updated Jun 22, 2022

Shallow baseline models for text in TensorFlow

Python 11 4 Updated Jul 1, 2017

A probabilistic programming language in TensorFlow. Deep generative models, variational inference.

Jupyter Notebook 4,833 757 Updated Mar 18, 2024
Next
Showing results