KNN Undersampling

This project contains the implementation of KNN Undersampling method in several languages, as proposed in the original paper.

Abstract

In supervised learning, the imbalanced number of instances among the classes in a dataset can make the algorithms to classify one instance from the minority class as one from the majority class. With the aim to solve this problem, the KNN algorithm provides a basis to other balancing methods. These balancing methods are revisited in this work, and a new and simple approach of KNN undersampling is proposed. The experiments demonstrated that the KNN undersampling method outperformed other sampling methods. The proposed method also outperformed the results of other studies, and indicates that the simplicity of KNN can be used as a base for efficient algorithms in machine learning and knowledge discovery.

For more details about the KNN Undersampling, please visit: Beckmann, M., Ebecken, N.F.F.,de Lima, B.S.L.B.P. 2015, A KNN Undersampling Approach for Data Balancing.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
KNNUndersampling.java		KNNUndersampling.java
KNNUndersamplingWeka.java		KNNUndersamplingWeka.java
README.md		README.md
beckmann_2015_knn_undersampling_all_datasets.zip		beckmann_2015_knn_undersampling_all_datasets.zip
knn_und.R		knn_und.R
knn_undersampling.ipynb		knn_undersampling.ipynb
pima-indians-diabetes_normalize.csv		pima-indians-diabetes_normalize.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

KNN Undersampling

About

Releases

Packages

Languages

marcelobeckmann/knnund

Folders and files

Latest commit

History

Repository files navigation

KNN Undersampling

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages