Simona Bisiani
Academic and research departments
Surrey Institute for People-Centred Artificial Intelligence (PAI), Faculty of Engineering and Physical Sciences.About
My research project
Large-Scale Content Analyses to Uncover Causes and Effects of Spatial Inequalities in Local News Quality and QuantityTLDR - My research uses a variety of methods, ranging from qualitative (e.g., interviews) to quantitative (e.g., text mining) to dissect the state of local news in the UK. Particularly, I am interested in the spatial distribution of local news content across the UK, and the intersection of local news quality with ownership concentration and democratic participation. I believe my research will support policymakers to better understand which communities are underserved when it comes to local news in the UK, and why it matters.
Abstract - The UK's digital local journalism sector, the largest in terms of local news provision, faces a significant economic crisis. Marked by increased ownership consolidation, the closure of outlets, and a downturn in the quality of news, this crisis poses a serious threat to the availability of varied local news, which is vital for the health of democracy. Despite this crisis’ significance, a significant gap exists in our understanding of the number and distribution of active operational outlets, the communities they cater to, and the quality and extent of the local news content they produce. As a result, there is a lack of empirical evidence of the robustness of today’s local news, undermining the government’s ability to generate meaningful interventions to sustain the local digital press. This study aims to address this gap by examining the crisis's depth, determinants, and implications. The research has five objectives: (1) Identify active local news outlets; (2) Map the spatial distribution of these outlets’ local news content to analyse geographic disparities in coverage; (3) Assess the prevalence of critical and diverse information that is considered fundamental for democratic functioning and pluralism; (4) Examine the effects of ownership consolidation on content quality and quantity; (5) Investigate how content variations affect democratic processes. To achieve this, the thesis adopts a quantitative research design that employs algorithmic text analysis and statistical inference. This research will offer a comprehensive evaluation of the robustness of UK local journalism based on the properties of the news that communities are supplied with. By doing so, the thesis will advance quantitative research on the state of local news provision through the development of various methods for content collection and analysis.
Supervisors
TLDR - My research uses a variety of methods, ranging from qualitative (e.g., interviews) to quantitative (e.g., text mining) to dissect the state of local news in the UK. Particularly, I am interested in the spatial distribution of local news content across the UK, and the intersection of local news quality with ownership concentration and democratic participation. I believe my research will support policymakers to better understand which communities are underserved when it comes to local news in the UK, and why it matters.
Abstract - The UK's digital local journalism sector, the largest in terms of local news provision, faces a significant economic crisis. Marked by increased ownership consolidation, the closure of outlets, and a downturn in the quality of news, this crisis poses a serious threat to the availability of varied local news, which is vital for the health of democracy. Despite this crisis’ significance, a significant gap exists in our understanding of the number and distribution of active operational outlets, the communities they cater to, and the quality and extent of the local news content they produce. As a result, there is a lack of empirical evidence of the robustness of today’s local news, undermining the government’s ability to generate meaningful interventions to sustain the local digital press. This study aims to address this gap by examining the crisis's depth, determinants, and implications. The research has five objectives: (1) Identify active local news outlets; (2) Map the spatial distribution of these outlets’ local news content to analyse geographic disparities in coverage; (3) Assess the prevalence of critical and diverse information that is considered fundamental for democratic functioning and pluralism; (4) Examine the effects of ownership consolidation on content quality and quantity; (5) Investigate how content variations affect democratic processes. To achieve this, the thesis adopts a quantitative research design that employs algorithmic text analysis and statistical inference. This research will offer a comprehensive evaluation of the robustness of UK local journalism based on the properties of the news that communities are supplied with. By doing so, the thesis will advance quantitative research on the state of local news provision through the development of various methods for content collection and analysis.
Publications
This dataset contains several spreadsheets within which four public datasets of print and digital local news outlets in the UK (JICREG, ABC, PINF, and MRC) are triangulated and combined. In addition, the observations from these four datasets have been manually verified to flag obsolete observations. This helped generate a novel, powerful list of print and digital local news outlets (this can be found in sheet "Stage 2 - clean df with enhancements" and includes any observations marked as 1 under the "Baseline" column). The script where manipulation of these datasets occur can be found here: https://github.com/simonabisiani/Local-News-Datasets-Triangulation. The dataset was used to carry out research which resulted in the following journal article: https://www.mdpi.com/2673-5172/4/4/77.
Local journalism is fundamental for a thriving democracy, yet the UK faces a decline in the number of print and digital local news outlets. Large-scale mappings of the surviving outlets offer invaluable insights to policymakers designing interventions to strengthen the sector. Due to the lack of a comprehensive national directory of UK print and digital local news outlets, researchers have resorted to datasets such as circulation auditors’ databases, which have been noted to be incomplete and outdated. A lack of understanding of the magnitude of these data limitations hinders researchers from selecting optimal datasets. This study evaluates four commonly used local news databases, uncovering significant variations in their currentness and comprehensiveness. Thereafter, statistical analyses demonstrate the significant effect of each dataset’s shortcomings on findings in local news research. To address this issue, triangulation and manual verification are employed to create a more comprehensive and robust dataset. This procedure generates a new national dataset of print and digital local news outlets that can be used in future research, alongside a framework for leveraging public data to build an independent research dataset. This work paves the way for more rigorous research in data-driven local news provision studies. Concluding remarks stress the importance of setting definitions and establishing clear data pipelines in an increasingly diversified and dynamic sector.
In the last decade, data journalism has established itself as a thriving field. Recently, Covid-19 has boosted the demand for data-driven reporting to make sense of the pandemic, increasing the importance of studying the evolution of this rapidly evolving and technology-bounded practice. However, the number of efforts to map and systematically measure the data journalism industry are few. This paper analyses the findings of The State of the Data Journalism Survey 2021, currently the most extensive study on the characteristics surrounding the workforce producing and contributing to the data journalism industry. The outcome is an understanding of an expanding workforce with a geographically uneven distribution, which is still homogeneous in terms of tools and educational paths. Self-taught, resourceful, and multi-skilled, data journalists often work in isolation but share pressures of limited resources, time limitations, and access to quality data. The pandemic appears to have directly increased those struggles, although data journalists agree that the field’s reputation has ultimately benefited from it.