Explore Data Science: Brief Description About Modelling Techniques in DS project

Thursday, 12 June 2014

Brief Description About Modelling Techniques in DS project

There are various statistical and machine learning modelling techniques that can be applied in solving a Data Science Problems. Few of these techniques can be listed as:

Classification : Type of Supervised Learning. Dividing items into predefined categories. There are various algorithm available for clustering are Navie Bayes, Decision Tree, Logistic Regression (If a decision boundary/thershold is defined) and Support Vector Machine. For example predicting if there will be a rain tomorrow or not.

Regression/Scoring: Type of Supervised Learning .This is task for predicting numeric value or score. For example predicting the rain fall that might occur tomorrow in-terms of inches. Different algorithms used for regression analysis are logistic regression (to predict probabilities) and linear regression.

Clustering : Type of Unsupervised Learning This is a task of grouping items into most similar groups. Different algorithm available for clustering are K-Means, Hierarchical.

Recommendations: Producing a list of recommendations in either of way a) based upon user's past experience and on similar experience faced by some other user. b) based on a comparison between the content of items and a user profile. For example Nearest Neighbour Alogrithm.

Association Rules: This is about finding relationship among the variables. Finding correlations or reasons behind the effects observed in the data. Algorithm available for association rules mining are APRIORI.

No comments:

Post a Comment

Subscribe to: Post Comments (Atom)