We will be attempting to classify Yelp Reviews into 1 star or 5 star categories based off the text content in the reviews.
Model is made on yelp dataset. Each observation in this dataset is a review of a particular business by a particular user.
Starting with pre-processeing like Vectorization, Tfidf Transformer etc.
Continue with EDA to explore the data.
Training model with Multinomial Naive Bayes algorithm.
Checking for classification report.
Also Creating a Data Pipeline to store a pipeline of workflow.