Skip to content

SoftwareImpacts/SIMPAC-2021-182

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

40 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Data Science Toolkit

readthedocs License Reproducible Capsule

Data Science Toolkit (DST) is a Python library that helps implement data science related project with ease.

Simple Demo

data = DataFrame()
data.load_dataset('iris')
y = data.get_column('target')
data.drop_column('target')

# decision tree model
model = Model(data_x=data.get_dataframe(), data_y=y, model_type='dt', training_percent=0.8)

# train the model
model.train()

# get all classification evaluation metrics
model.report()

#get the cross validation
model.cross_validation(5)

Documentation

More information can be found on the DST documentation site.

Contributing

Contrubution and suggestions are welcome via GitHub Pull Requests.

Maintainership

We're actively enhacing the repo with new algorithms.

How to cite

About

Data Science Toolkit

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%