KU-AICC202-Intro-to-Data-Science

Kathmandu University Department of Computer Science

Subject: Introduction to Data Science

Course Code: AICC 202

Level: BTech in AI 2nd year 1st semester

Credit Hours: 3

Type: Core [Theory + Practical]

Course Description

Data Science called as “the sexiest job of the 21st century” by Harvard Business Review., and in this period “the world’s most valuable resource is no longer oil, but data”. This course helps students to understand the basic concepts of Data Science like information extraction from the vast amounts of data using different scientific methods. It explains how the data is manipulated and processed to unravel useful underlying information from the raw and unstructured data. The course follows an example-based approach in terms of providing a better understanding on how data science techniques are applied in the real-world problems. Furthermore, the course provides insights to the ethical use of data, which has become very crucial with the rampant and abundant generation of data.

Programming Language used: Python, Julia, R

Course Objective

Prerequisites

It is expected that students have prior knowledge of mathematical preliminaries such as Probabilities and Statistics, Calculus and Linear Algebra. Besides, students should have the knowledge of high-level programming languages like C and C++ or Python to understand the concepts of Data Science and implement projects.

Course Evaluation

Internal Examination: 50% Final Examination: 50%

Chapters

Chapter 1: Introduction to DataScience [6 Hrs.]

Practical:

Chapter 2: Data Wrangling [9 Hrs.]

Chapter 3: Data to Models: Score, Rank, Class, Clusters [6 Hrs.]

Chapter 4: Visualizing Data [6 Hrs.]

Chapter 5: Data Analysis [12 Hrs.]

Optional

Chapter 6: Big Data [6 Hrs.]

Text Books

  1. Steven S. Skiena. The Data Science Design Manual. Springer Cham, 2017
  2. Cathy O’Neil, Rachel Schutt. Doing Data Science. O’REILLY Media, Inc., 2013

Reference Books:

  1. Joel Grus. Data Science from Scratch 2e: First Principles with Python. O’REILLY, 2019
  2. Ivo D. Dinov. Data Science and Predictive Analytics. Springer, 2018