This repo contains data which is used in ASU DB courses on Coursera. The data is parsed in order to fit in the course requirements. The original data is from MovieLens website@University of Minnesota
ASU offers several courses which focus on different topics in the area of data systems.
ASU provides a free version of the database course. It is available on Coursera Data in Databases
Big Data analytics tools are increasingly critical for providing meaningful information for making better business decisions. Big data technologies bring significant cost advantages when it comes to storing and managing large amounts of data. Understanding how to query a database to extract data will empower better analysis of large, complex datasets. Knowledge of Indexing mechanisms makes possible high-speed, selective retrieval of large amounts of information.
The complete version of this course can be found in ASU Online Master of Computer Science program on Coursera. The official title of this course is CSE 511 Data Processing at Scale.
This online program is ranked in the Top 10 for Online Graduate Engineering Programs by U.S. News and World Report
Specific topics covered include:
- Efficient query processing
- Indexing structures
- Distributed database design
- Parallel query execution
- Concurrency control in distributed parallel database systems
- Data management in cloud computing environments
- Data management in Map/Reduce-based
- NoSQL database systems
Jia Yu is one of the designers of ASU DB Courses on Coursera.
Jia is a PhD student at the Computer Science department, School of Computing, Informatics, and Decision Systems Engineering (CIDSE), Arizona State University, where he is a member of Data Systems Lab. Jia’s research focuses on database systems and geospatial data management. In particular, he worked on distributed data management systems, database indexing, data visualization. He is the main contributor of several open-sourced research projects such as GeoSpark, a cluster computing framework for processing big spatial data.