I am using an API from WebHose.io to download 100 blogs. To perform data analysis, I have trained the pre-downloaded historical blogs on TF-IDF (Term Frequency- Inverse Document Frequency) and LDA (Latent Dirichlet Allocation) Algorithms to classify and cluster the blogs into groups based on its text (blog content). These algorithms have further been tested on live blogs scrapped by the WebHose API.
Please see the README.md file in final folder to know more.
Thank you.
PrithviKamath/Live-Blog-Analysis
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
Topic Modelling on real-time blogs with TF-IDF and LDA algorithms
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published