410250: Machine Learning
Credit 03
Unit I Introduction to Machine learning 08 Hours
Classic and adaptive machines, Machine learning matters, Beyond machine learning-deep learning and bio inspired adaptive systems, Machine learning and Big data.
Important Elements of Machine Learning- Data formats, Learnability, Statistical learning approaches, Elements of information theory.
Unit II Feature Selection 08 Hours
Scikit- learn Dataset, Creating training and test sets, managing categorical data, Managing missing features, Data scaling and normalization, Feature selection and Filtering, Principle Component Analysis(PCA)-non negative matrix factorization, Sparse PCA, Kernel PCA. Atom Extraction and Dictionary Learning.
Unit III Regression 08 Hours
Linear regression- Linear models, A bi-dimensional example, Linear Regression and higher dimensionality, Ridge, Lasso and ElasticNet, Robust regression with random sample consensus, Polynomial regression, Isotonic regression,
Logistic regression-Linear classification, Logistic regression, Implementation and Optimizations, Stochastic gradient descendent algorithms, Finding the optimal hyper-parameters through grid search, Classification metric, ROC Curve.
Unit IV Naïve Bayes and Support Vector Machine 08 Hours
Bayes? Theorom, Naïve Bayes? Classifiers, Naïve Bayes in Scikit- learn- Bernoulli Naïve Bayes, Multinomial Naïve Bayes, and Gaussian Naïve Bayes.
Support Vector Machine(SVM)- Linear Support Vector Machines, Scikit- learn implementation- Linear Classification, Kernel based classification, Non- linear Examples. Controlled Support Vector Machines, Support Vector Regression.
Unit V Decision Trees and Ensemble Learning 08 Hours
Decision Trees- Impurity measures, Feature Importance. Decision Tree Classification with Scikit- learn, Ensemble Learning-Random Forest, AdaBoost, Gradient Tree Boosting, Voting Classifier.
Clustering Fundamentals- Basics, K-means: Finding optimal number of clusters, DBSCAN, Spectral Clustering. Evaluation methods based on Ground Truth- Homogeneity, Completeness, Adjusted Rand Index.
Introduction to Meta Classifier: Concepts of Weak and eager learner, Ensemble methods, Bagging, Boosting, Random Forests.
Unit VI Clustering Techniques 08 Hours
Hierarchical Clustering, Expectation maximization clustering, Agglomerative Clustering- Dendrograms, Agglomerative clustering in Scikit- learn, Connectivity Constraints.
Introduction to Recommendation Systems- Naïve User based systems, Content based Systems, Model free collaborative filtering-singular value decomposition, alternating least squares. Fundamentals of Deep Networks-Defining Deep learning, common architectural principles of deep networks, building blocks of deep networks.
Books:
Text:
1. Giuseppe Bonaccorso, “Machine Learning Algorithms”, Packt Publishing Limited, ISBN- 10: 1785889621, ISBN-13: 978-1785889622
2. Josh Patterson, Adam Gibson, “Deep Learning: A Practitioners Approach”, O?REILLY, SPD, ISBN: 978-93-5213-604-9, 2017 Edition 1st
References:
1. Ethem Alpaydin, “ Introduction to Machine Learning”, PHI 2nd Edition-2013, ISBN 978-0- 262-01243-0
2. Peter Flach, “Machine Learning: The Art and Science of Algorithms that Make Sense of Data”, Cambridge University Press, Edition 2012, ISBN-10: 1107422221; ISBN-13: 978- 1107422223
3. Tom Mitchell “Machine Learning” McGraw Hill Publication, ISBN :0070428077 9780070428072
4. Nikhil Buduma, “Fundamentals of Deep Learning”, O?REILLY publication, second edition 2017, ISBN: 1491925612