Skip to Main Content

CSC 476 Text Mining with Python

An introductory course to analyze and process text data. A lot of today's communication is conducted through social media, like Twitter, Facebook, Instagram and WhatsApp, and the majority of this data is in textual form. Due to the unstructured nature of the data, it needs to be processed with Natural Language Processing or Text Mining methods. In this course, students will learn regular expressions and handling text with Python libraries such as Pandas and NLTK. Students will also perform text analysis tasks such as identifying features and classification with Naive Bayes, Support Vector Machines and Long-Short Term Memory Deep Learning Algorithms. This course requires prior knowledge in Python or permission of the instructor. Prerequisite: CSC 302 or CSC 352, with a grade of C+ or better.

Credits

3