Standard of papers would be that of Post-Graduation Degree Examination of any recognized Institution/ University in India.
Paper - I and Paper – II
DSIM 101 : Probability and Statistics
1. Theory of Probability, Probability Distributions and Sampling Theory • Classical and axiomatic approach of probability and its properties, Bayes Theorem and its application, strong and weak laws of large numbers, characteristic functions, central limit theorem, probability inequalities. • Standard probability distributions - Binomial, Poison, Geometric, Negative binomial, Uniform, Normal, exponential, Logistic, Log-normal, Beta, Gamma, Weibull, Bivariate normal etc. • Exact Sampling distributions - Chi-square, student’s t, F and Z distributions and their applications. Asymptotic sampling distributions and large sample tests, association, and analysis of contingency tables. • Standard sampling methods such as simple random sampling, Stratified random sampling, Systematic sampling, Cluster sampling, Two stage sampling, Probability proportional to size etc. Ratio estimation, Regression estimation, non-sampling errors and problem of non-response.
DSIM 102 : Linear Models and Economic Statistics
• Linear algebra - Vector, matrices, spanning of vector space, matrix algebra, inverse of partitioned matrices, g-inverse, orthogonal matrices, properties of idempotent matrices, characteristic roots and vectors, Cayley-Hamilton theorem, quadratic forms, definite, semidefinite and indefinite forms, simultaneous reduction of two quadratic forms, properties of similar matrices. • Simple linear regression - assumptions, estimation, and inference diagnostic checks; polynomial regression, transformations on Y or X (Box-Cox, square root, log etc.), method of weighted least squares, inverse regression. Multiple regression - Standard Gauss Markov setup, least squares estimation and related properties, regression analysis with correlated observations. Simultaneous estimation of linear parametric functions, Testing of hypotheses; Confidence intervals and regions; Multicollinearity and shrinkage models (ridge regression, LASSO, Elastic Net) model selection criteria, residual diagnostics, categorical data analysis using dummy variables; Outlier detection and treatment. • Definition and construction of index numbers, Standard index numbers; Conversion of chain base index to fixed base and vice-versa; base shifting, splicing and deflating of index numbers; Measurement of economic inequality: Gini's coefficient, Lorenz curves etc. Basics of macroeconomics and national accounts.
DSIM 103 : Statistical Inference and Non-Parametric Test
Estimation, Testing of Hypothesis and Non-Parametric Test Estimation • Concepts of estimation, unbiasedness, sufficiency, consistency, and efficiency. Factorization theorem. Complete statistic, Minimum variance unbiased estimator (MVUE), Rao-Blackwell and Lehmann-Scheffe theorems and their applications. Cramer-Rao inequality. Methods of Estimation • Method of moments, method of maximum likelihood estimation, method of least square, method of minimum Chi-square, basic idea of Bayes estimators. Principles of Test of Significance • Type-I and Type-II errors (False-positive and False-negative errors), critical region, level of significance, size, p value & its interpretation and power, best critical region, most powerful test, uniformly most powerful test, Neyman Pearson theory of testing of hypothesis. Likelihood ratio tests, Tests of goodness of fit. Bartlett's test for homogeneity of variances. Non-Parametric Test • The Kolmogorov-Smirnov test, Sign test, Wilcoxon Signed-rank test, Wilcoxon Rank-Sum test, Mann Whitney U-test, Kruskal-Walls one way ANOVA test, Friedman’s test, Kendall’s Tau coefficient, Spearman’s coefficient of rank correlation. • Distribution of order statistics, distribution fitting, kernel density estimation.
DSIM 104 : Stochastic Processes Poisson process
• Arrival, interarrival and conditional arrival distributions. Non-homogeneous Processes. Law of Rare Events and Poisson Process. Compound Poisson Processes. Markov Chains • Transition probability matrix, Chapman- Kolmogorov equations, Regular chains and Stationary distributions, Periodicity, Limit theorems. Patterns for recurrent events. Brownian Motion - Limit of Random Walk, its defining characteristics, and peculiarities; Martingales.
DSIM 105 :Multivariate Analysis • Multivariate normal distribution and its properties and characterization; Logit-Probit models Mahalanobis’ D2 statistics; linear discriminant analysis (LDA); Canonical correlation analysis, Principal components analysis, Factor analysis, Cluster analysis.
DSIM 106 : Econometrics and Time Series
• General linear model and its extensions, ordinary least squares and generalized least squares estimation and prediction, heteroscedastic disturbances, pure and mixed estimation. Auto correlation, its consequences, and related tests; Theil BLUS procedure, estimation, and prediction; issue of multi-collinearity, its implications, and tools for handling it; Ridge regression. • Linear regression and stochastic regression, instrumental variable regression, panel regression, autoregressive linear regression, distributed lag models, estimation of lags by OLS method. Simultaneous linear equations model and its generalization, identification problem, restrictions on structural parameters, rank and order conditions; different estimation methods for simultaneous equations model, prediction and simultaneous confidence intervals. • Exploratory analysis of time series; Concepts of weak and strong stationarity; AR, MA and ARMA processes and their properties; model identification based on ACF and PACF; model estimation and diagnostic tests; Box- Jenkins models; ARCH/GARCH models. Inference with Non-Stationary Models • ARIMA/SARIMA model, determination of the order of integration, trend stationarity and difference stationary processes, tests of non-stationarity.
DSIM 107 : Optimization and Statistical Computing
• Unconstrained optimization using calculus (Taylor‘s theorem, convex functions, coercive functions). Unconstrained optimization via iterative methods (Newton‘s method, Gradient/ conjugate gradient-based methods, Quasi Newton methods). Constrained optimization (Penalty methods, Lagrange multipliers). Convex sets, Convex hull, Formulation of a Linear Programming Problem, Theorems dealing with vertices of feasible regions and optimality, Graphical solution, Simplex method. • Simulation techniques for various probability models, and resampling methods jack-knife, bootstrap and cross- validation; techniques for robust linear regression, nonlinear and generalized linear regression problem, tree- structured regression and classification; Analysis of incomplete data - EM algorithm, single and multiple imputation; Bayesian modelling and estimation; Markov Chain Monte Carlo and annealing techniques, Gibbs sampling, Metropolis-Hastings algorithm; Neural Networks, Association Rules and learning algorithms.
DSIM 108 : Data Science, Artificial Intelligence and Machine Learning Techniques
• Introduction to supervised and unsupervised pattern classification; unsupervised and reinforcement learning, basics of optimization, model accuracy measures. Linear Regression, Logistic Regression, Penalized Regression, Naïve Bayes, Nearest Neighbor, Decision Tree, Support Vector Machine, Kernel density estimation and kernel discriminant analysis; Classification under a regression framework, neural network, kernel regression and tree and random forests. Hierarchical and non-hierarchical methods: k-means, kmedoids and linkage methods, Cluster validation indices: Dunn index, Gap statistics. Bagging (Random Forest) and Boosting (Adaptive Boosting, Gradient Boosting) techniques; Recurrent Neural Network (RNN); Convolutional Neural Network; Natural Language Processing. Recursive Feature Elimination (RFE), Variance Inflation Factor (VIF), ensembling and stacking methods, Elastic Net regularization, hyperparameter tuning via Grid Search, feature importance interpretation, and cross-validation strategies.
DSIM 109 : Database and Data Warehouse Management
• Data structures; Fundamentals of Relational Database Management Systems (RDBMS) and non-traditional (NoSQL) databases. Principles of database normalization, data redundancy elimination, and consistency maintenance. Structured Query Language (SQL) – querying, updating, aggregating, and managing relational data. Database joins – inner join, left join, right join, outer join – with applications to data merging and integration. Overview of NoSQL databases – document-based, key-value, wide-column stores, and graph databases. Data warehousing concepts, star and snowflake schemas, ETL (Extract, Transform, Load) processes, and OLAP vs OLTP. Database indexing and optimization. Basics of big data frameworks and storage systems for large-scale data handling.