CS Researcher · BARTA Lab · May 2025 – Present

Narrative Analysis of the July Revolution

Built the largest known dataset of news articles covering Bangladesh's July Revolution and analyzed temporal shifts in media narratives, newspaper stances, and mainstream-vs-social-media coverage.

Python
Web Scraping
Stance Classification
NLP

What this is

A research project examining how media narratives shifted during and after Bangladesh’s July Revolution. The dataset is the largest collection of news articles on the topic. The analysis covers temporal patterns: how language changed week to week, how different newspapers diverged in framing, and where mainstream coverage parted ways with social media.

What’s in the dataset

News articles scraped from major Bangladeshi outlets, normalized and tagged with publication date, source, and metadata sufficient for cross-source comparison. Coverage spans the lead-up, the events themselves, and the aftermath.

What the analysis shows

Three threads I found most interesting:

Stance shifts within the same outlet over time. Specific newspapers changed framing partway through the period, in ways visible in word choice and headline structure. Which means, some newspapers were on the protestor’s side from the beginning, while a few changed their stance when the scales shifted.
Mainstream vs social media gap. Topics that dominated social media coverage often arrived in mainstream reporting later, or framed differently, or not at all. For example, in the second slide, we can see a news story about a celebrity being run on a newspaper website while a major piece of incident related to the protests is completely ignored.
Temporal clustering. Coverage volume around specific events was unevenly distributed across outlets in ways that suggest editorial decisions rather than news flow: some newspapers gave attention to the protest, some totally ignored it.

Why this exists

The July Revolution is recent enough that primary-source narrative analysis still matters for the historical record. Datasets compiled close to events tend to be more honest than ones reconstructed later. I worked on this project because I wanted this archive to exist, and the hypocrisy of our national newspapers to be quantitatively exposed.

The slide deck linked above walks through the methodology, the dataset structure, and the key findings.