Katedra českého jazyka FF OU

International Conference on Corpus and Computational Linguistics Ostrava 2025, August 20-21

Date: August 20–21, 2025

Location: Room E-204, tř. Čs. legií 150/9, Ostrava


Program on Wednesday (August 20)

International Workshop on Corpus-Based Analysis of Disinformation Texts

This workshop presents the ongoing research from the grant project Biography of Fake News with a Touch of AI: Dangerous Phenomenon through the Prism of Modern Human Sciences. The focus will be on Research Work Package 1.1, which examines Czech disinformation texts using contemporary quantitative linguistic methods.

The research utilizes a corpus comprising both disinformation and standard news media texts to identify distinctive features of fake news at the lexical, syntactic, and morphological levels. The aim is to share current findings, showcase methodology, and foster scholarly discussion on the direction of future work.

TimeSpeaker(s)Topic
14:00–14:30Miroslav KubátIntroduction to the Project
14:30–15:00Michaela NogolováData: Sources and Corpus Construction
15:00–15:45Michal MísteckýMorphological Analysis
15:45–16:30Xinying ChenSyntactic Features of Fake News
16:30–17:15Michal Místecký and Michaela NogolováLexical Characteristics
17:15–18:00Break
18:00–21:00Group Discussion

This workshop was supported by Operational Program Jan Ámos Komenský project ‘Biography of Fake News with a Touch of AI: Dangerous Phenomenon through the Prism of Modern Human Sciences’ (reg. CZ.02.01.01/00/23_025/0008724)

Spolufinancováno Evropskou unií


Program on Thursday (August 21)

TimeSpeaker(s)Topic
08:30–09:00Yaqin Wang and Emmerich KelihModelling of rank frequency distribution of core vocabulary in different semantic fields: Preliminary results from Slovene with special attention to loanwords
09:00–09:30Bingli Liu and Yiyi ZhaoA Quantitative Study of Subject-predicate-Object Word Class Composition in vernacular Chinese Based on Dependency Grammar
09:30–10:00Ján Mačutek, Radek Čech, Michaela KoščováOn the relation between word length and phoneme sonority
10:00–10:30Jiří MiličkaThe Memetic Spread of AI Personas: A Case Study of Bing Sydney
10:30–11:00Jianwei YanA Power-Law Analysis of Word Length and Frequency in Classical, Modern, and AI Chinese
11:00–11:30Xinying ChenCross-Linguistic Evidence for Continuous Semantic Representation in Contextual Embeddings
11:30–12:00Michaela NogolováFrom Simplification to Authenticity: Syntactic Complexity in Czech Adapted Texts

Book of Abstracts