Skip to content

Java programs used to clean and parse CSV for Bachelors Final DIssertation at UFPB

Notifications You must be signed in to change notification settings

omar-sr88/ReadCVS

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ReadCVS

Java programs used to clean and parse CSV for Bachelors Final Dissertation at UFPB

Hello,

(Yes, the project is mispelled, but I used it for so long I grew to like it =D )

As the Final Dissertation project for my Computer Science degree I did this analysis on some amount of open data relasead by the Brazilian government regarding the PAC (Programa de Aceleração do Crescimento - Growth Acceleration Program).

The data released on http://dados.gov.br/dataset/obras-do-pac-programa-de-aceleracao-do-crescimento was separeted in many files, each corresponding to a period of about 3 months. These files weren't perfectly formated ( in the .csv format) and therefore needed cleaning.

They were also not cumulative. Eg. if some construcion took 3 cycles to get ready, it would not show up in the 4th cycle file. So there was need for a program to get all items into one file.

The cleanned data was feeded to Tableau and the result was displayed on: http://omarsr.wix.com/dadospac

Data used: https://app.box.com/s/wdzclj2krr8gguipotxqkwcw4t4p4qln

Currently rewriting the parse code in Ruby and propably going to use XML file to skip char errors (which were quite common).

About

Java programs used to clean and parse CSV for Bachelors Final DIssertation at UFPB

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages