Decision-tree-classifier

A decision tree is a tree-based supervised learning method used to predict the output of a target variable.
Imagine you want to predict whether it is going to snow tomorrow? or not?, so that you can decide whether to wake up early or not to shove the snow. To predict this event, you utilize the historical weather data, and here is the related scattered plot for the given data.

As you can see in the plot, all the red data point - the snow instances - happen under the temperature lower than 30 and the humidity level above 70. If we transfer this scatter plot into a decision tree diagram, it should look like something below:

For this snow dataset, you can predict whether it is going to snow or not perfectly by asking two questions:

Is Temperature below 30 ?
Is Humidity above 70 ?

Simple implementation

# Decision Tree Classifier Implementation using Sklearn
# Step1: Load the data
from sklearn import datasets
iris = datasets.load_iris()
X = iris.data
y = iris.target

# Step2: Split the data
from sklearn.model_selection import train_test_split
X_train,X_test,y_train,y_test = train_test_split(X,y,test_size = 0.2,
random_state = 42)

# Step3: train the model
from sklearn import tree
clf = tree.DecisionTreeClassifier()
clf = clf.fit(X_train,y_train)
pred = clf.predict(X_test)

# Step4: evaluate the model performance using test set
from sklearn.metrics import accuracy_score
acc = accuracy_score(y_test,pred)
print('accuracy rate', acc)
# accuracy rate 1.0

# Step5: print the decision tree
import graphviz 
dot_data = tree.export_graphviz(clf, out_file=None, 
                         feature_names=iris.feature_names,  
                         class_names=iris.target_names,  
                         filled=True, rounded=True,  
                         special_characters=True)  
graph = graphviz.Source(dot_data) 
graph.render("iris classification")
# 'iris classification.pdf'

import graphviz 
dot_data = tree.export_graphviz(clf, out_file=None, 
                         feature_names=iris.feature_names,  
                         class_names=iris.target_names,  
                         filled=True, rounded=True,  
                         special_characters=True)  
graph = graphviz.Source(dot_data)  
graph

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
example		example
LICENSE		LICENSE
README.md		README.md
decision-tree-classifier.py		decision-tree-classifier.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Decision-tree-classifier

Simple implementation

result

About

Releases

Packages

Languages

License

PyMachine-Collective/Decision-tree-classifier

Folders and files

Latest commit

History

Repository files navigation

Decision-tree-classifier

Simple implementation

result

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages