Thursday 12 June 2014

Brief Description About Modelling Techniques in DS project


There are various statistical and machine learning modelling techniques that can be applied in solving a Data Science Problems. Few of these techniques can be listed as:

  • Classification : Type of Supervised Learning. Dividing items into predefined categories. There are various algorithm available for clustering are Navie Bayes, Decision Tree, Logistic Regression (If a decision boundary/thershold is defined) and Support Vector Machine.  For example predicting if there will be a rain tomorrow or not.                                                                                                                                                   
  • Regression/Scoring:  Type of Supervised Learning .This is task for predicting numeric value or score. For example predicting the rain fall that might occur tomorrow in-terms of inches. Different  algorithms  used for regression analysis are logistic regression (to predict probabilities) and linear regression.
  • Clustering : Type of Unsupervised Learning  This is a task of grouping items into most similar groups.  Different algorithm available for clustering are K-Means, Hierarchical.
  • Recommendations: Producing a list of recommendations in either of way a) based upon user's past experience and on similar experience faced by some other user. b) based on a comparison between the content of items and a user profile. For example Nearest Neighbour Alogrithm.
  • Association Rules: This is about finding relationship among the variables. Finding correlations or reasons behind the effects observed in the data. Algorithm available for association rules mining are APRIORI.

No comments:

Post a Comment