TabDPT: Scaling Tabular Foundation Models

Installation

To run TabDPT, install the following packages:

pytorch
numpy
scikit-learn
faiss

Update December 2024

Added support for flash attention (with bf16 precision) and compile flag. Both are enabled to True by default and should lead to a significant speed-up.

Update January 2025

Weights are now stored on Git LFS, at the path checkpoints/tabdpt_76M.ckpt, instead of Google drive.

Example Usage 1

from sklearn.metrics import accuracy_score
from sklearn.datasets import load_breast_cancer
from sklearn.model_selection import train_test_split
from tabdpt import TabDPTClassifier

X, y = load_breast_cancer(return_X_y=True)
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.33, random_state=42)

model = TabDPTClassifier(path='checkpoints/tabdpt_76M.ckpt', use_flash=True, compile=True)
model.fit(X_train, y_train)
y_pred = model.predict(X_test, temperature=0.8, context_size=1024)
print(accuracy_score(y_test, y_pred))

Example Usage 2

from sklearn.metrics import accuracy_score
from sklearn.datasets import fetch_california_housing
from sklearn.model_selection import train_test_split
from sklearn.metrics import r2_score
from tabdpt import TabDPTRegressor

X, y = fetch_california_housing(return_X_y=True)
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.33, random_state=42)

model = TabDPTRegressor(path='checkpoints/tabdpt_76M.ckpt')
model.fit(X_train, y_train)
y_pred = model.predict(X_test, context_size=1024)
print(r2_score(y_test, y_pred))

Roadmap

Release other model sizes
Release training code

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
checkpoints		checkpoints
.gitignore		.gitignore
README.md		README.md
tabdpt.py		tabdpt.py
tabdpt_model.py		tabdpt_model.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TabDPT: Scaling Tabular Foundation Models

Installation

Update December 2024

Update January 2025

Example Usage 1

Example Usage 2

Roadmap

About

Releases

Packages

Contributors 4

Languages

layer6ai-labs/TabDPT

Folders and files

Latest commit

History

Repository files navigation

TabDPT: Scaling Tabular Foundation Models

Installation

Update December 2024

Update January 2025

Example Usage 1

Example Usage 2

Roadmap

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages