Completed two additional modules of Text Mining. Started on a project that can accurately predict cuisine type based on input recipe. |
|
This week I have focused on creating more simple projects using R to do some Text Mining analysis. I am able to load a data set of recipes and classify them based on types of cuisine like Mexican, Indian, Chinese etc. This was a step ahead than the simple email classification project last week. Classification of recipes include multi-dimensional analysis as types of cuisines are more than two. I learnt how to extract patterns between ingredients of different recipe and plot them in two dimensional graph using R. That gave a visual representation of what I am doing and allow me to think more analytically.
Besides I completed the next two modules of my Text Mining course which also talked about statistical representation of textual patterns from long texts. My commitment for next week is to design a model that can accurately pattern type of cuisines for any input recipe of ingredients. |
|
Done with first week of Text Mining. Looking forward to post exciting updates in the next week. |
|
This week I finished two introductory courses on Text Mining, which is a crucial application of Machine Learning and Artificial Intelligence. Text Mining helps in parsing texts, filtering out important text and quality information from large documents.
An important application of text application is filtering out of spams from all emails.Text mining helps in ranking relevant information based on search queries. Being a motivated product manager aspirant, I want to understand how to use text mining to serve the society. This week I was working on a project on how to understand if a email is a spam or not based on the contents of the email and the sender email address. I learnt that there are some statistical difference in patterns of data based on word counts between spam and other emails. My commitment for next week is to understand some more in-depth concepts of text mining and how they help in email classification. |
|
3341935334193333378073337805