← Back to Teaching Course

文本挖掘

Text Mining and Computational Text Analysis

This course introduces text preprocessing, keyword extraction, topic modeling, text classification, sentiment analysis, and LLM-assisted evidence extraction. It is designed for research tasks involving academic literature, policy documents, patents, online communities, and business texts.

TM Graduate students / text analysis beginners Chinese with English terms

Course Introduction

文本挖掘的核心目标,是把非结构化文本转化为可以分析、解释和建模的信息资源。它既可以服务于关键词识别、主题发现和情感分析,也可以进一步支持知识发现、文献综述、政策分析和用户画像构建。

This course connects technical procedures with research design. It pays attention not only to algorithms, but also to how textual evidence can support a convincing academic argument.

Outline

Course Directory

Click a section to open the Markdown-based teaching document.

Module 1

Text Data and Preprocessing

Learn how to transform raw textual materials into analyzable research data.

Module 2

Topic Modeling and Text Classification

Understand classical and neural approaches to extracting semantic structures from texts.

Module 3

Research Design with Text Mining

Apply text mining to literature review, policy analysis, patent intelligence, and online community research.