Big Big Things in my Little Little World: Naive Bayes Text classification

Doc -> {+, -}
Documents are a vector or array of words
Conditional independence assumption: No relation exists between words and they are independent of each other.
Probability of review being positive is equal to probability of each word classified as positive while going through the entire length of document

Unique words- I, loved, the, movie, hated, a, great, poor, acting, good [10 unique words]
Involves 3 steps:
1. Convert docs to feature sets
2. Find probabilities of outcomes
3. Classifying new sentences

Convert docs to feature sets

Attributes: all possible words
Values: no: of times the word occurs in the doc

Find Probabilities of outcomes

Classifying new sentence

A calm and modest life brings more happiness than the pursuit of success combined with constant restlessness.

Big Big Things in my Little Little World

Sunday, 5 November 2017

Naive Bayes Text classification

Convert docs to feature sets

Find Probabilities of outcomes

Classifying new sentence

No comments:

Post a Comment

Blog Archive

Total Pageviews