Pre-requisites for DATA SCIENCE
DS 101 : Computer application and Data Processing
1. Basics of Computer
Operations of a computer, Different units of a computer system like central processing unit, memory unit, arithmetic and logical unit, input unit, output unit etc., Hardware including different types of input, output and peripheral devices, Software, system and application software, number systems, Operating systems, packages and utilities, Low and High level languages, Compiler, Assembler, Memory – RAM, ROM, unit of computer memory (bits, bytes etc.), Network – LAN, WAN, internet, intranet, basics of computer security, virus, antivirus, firewall, spyware, malware etc.
2. Basics of Programming
Algorithm, Flowchart, Data, Information, Database, overview of different programming languages, frontend and backend of a project, variables, control structures, arrays and their usages, functions, modules, loops, conditional statements, exceptions, debugging and related concepts
DS 102 : Computing with C
Basics of C Programming : Components of C language, structure of a C program, Data type, basic data types, Enumerated data types, Derived data types, variable declaration, Local, Global, Parametric variables, Assignment of Variables, Numeric, Character, Real and String constants, Arithmetic, Relation and Logical operators, Assignment operators, Increment and decrement operators, conditional operators, Bitwise operators, Type modifiers and expressions, writing and interpreting expressions, using expressions in statements. Basic input/output. Control statements: conditional statements, if - else, nesting of if - else, else if ladder, switch statements, loops in c, for, while, do - while loops, break, continue, exit ( ), goto and label declarations, One dimensional two dimensional and multidimensional arrays. Storage classes: Automatic variables, External variables, Static variables, Scope and lifetime of declarations. Functions: classification of functions, functions definition and declaration, assessing a function, return statement, parameter passing in functions. Pointers (concept only). Structure: Definition and declaration; structure (initialization) comparison of structure variable; Array of structures : array within structures, structures within structures, passing structures to functions; Unions accessing a union member, union of structure, initialization of a union variable, uses of union. Introduction to linked list, linear linked list, insertion of a node in list, removal of a node from list. Files in C: Defining and opening a file, input – output operation on a file, creating a file, reading a file.
DS 103 : Computing with R
Basics of R Programming : Components of R language, structure of a R program, Data type, basic data types, Statistics Methods and techniques in R
GATE SYLLABUS : DATA Sc. & ARTIFICIAL INTELLIGENCE (DA)
DA 101 : Probability and Statistics
Counting (permutation and combinations), probability axioms, Sample space, events, independent events, mutually exclusive events, marginal, conditional and joint probability, Bayes Theorem, conditional expectation and variance, mean, median, mode and standard deviation, correlation, and covariance, random variables, discrete random variables and probability mass functions, uniform, Bernoulli, binomial distribution, Continuous random variables and probability distribution function, uniform, exponential, Poisson, normal, standard normal, t-distribution, chi-squared distributions, cumulative distribution function, Conditional PDF, Central limit theorem, confidence interval, z-test, t-test, chi-squared test.
DA 102 : Linear Algebra
Vector space, subspaces, linear dependence and independence of vectors, matrices, projection matrix, orthogonal matrix, idempotent matrix, partition matrix and their properties, quadratic forms, systems of linear equations and solutions; Gaussian elimination, eigenvalues and eigenvectors, determinant, rank, nullity, projections, LU decomposition, singular value decomposition.
DA 103 : Calculus and Optimization
Functions of a single variable, limit, continuity and differentiability, Taylor series, maxima and minima, optimization involving a single variable.
DA 104 : Programming, Data Structures and Algorithms
Programming in Python, basic data structures: stacks, queues, linked lists, trees, hash tables; Search algorithms: linear search and binary search, basic sorting algorithms: selection sort, bubble sort and insertion sort; divide and conquer: mergesort, quicksort; introduction to graph theory; basic graph algorithms: traversals and shortest path.
DA 105 : Database Management and Warehousing
ER-model, relational model: relational algebra, tuple calculus, SQL, integrity constraints, normal form, file organization, indexing, data types, data transformation such as normalization, discretization, sampling, compression; data warehouse modelling: schema for multidimensional data models, concept hierarchies, measures: categorization and computations.
DA 106 : Machine Learning (ML)
(i) Supervised Learning: regression and classification problems, simple linear regression, multiple linear regression, ridge regression, logistic regression, k-nearest neighbour, naive Bayes classifier, linear discriminant analysis, support vector machine, decision trees, biasvariance trade-off, cross-validation methodssuch asleave-one-out (LOO) cross-validation, k-folds cross-validation, multi-layer perceptron, feed-forward neural network;
(ii) Unsupervised Learning: clustering algorithms, k-means/k-medoid, hierarchical clustering, top-down, bottom-up: singlelinkage, multiple-linkage, dimensionality reduction, principal component analysis.
DA 107 : ArtificiaI Intelligence (AI)
Search: informed, uninformed, adversarial; logic, propositional, predicate; reasoning under uncertainty topics — conditional independence representation, exact inference through variable elimination, and approximate inference through sampling.