FakeNarratives – First Forays in Understanding Narratives of Disinformation in Public and Alternative News Videos

Tseng, Chiao-I; Liebl, Bernhard; Burghardt, Manuel; Bateman, John
https://zenodo.org/records/7715277
Zum TEI/XML Dokument

Introduction: Narratives of Disinformation

Audiovisual media, such as film, TV, webseries, YouTube-videos and so on, have long become one, if not the, dominant cultural communication form of our times (Mirzoeff, 1999; Mitchell, 2001). Their role in shaping sociocultural configurations of all kinds appears incontestable. However, description and analysis of such media presents immense challenges which have so far resisted scalable solutions. Although it is becoming increasingly possible to conduct larger-scale stylistic and formal feature analysis (shot lengths, brightness, color profiles, dynamicity, etc.: cf., e.g., Heftberger, 2018), productive engagement with such media as powerful sociocultural artifacts demands in addition hermeneutic and functional interpretations as pursued in many branches of the humanities. Analyses of these kinds have not so far been possible within the digital humanities.

In an ongoing BMBF project on FakeNarratives1 , we combine expertise, experience and tools crucial for making general advances towards these goals, practically directed at a soundly delimited and, at the same time, socially significant class of audiovisual artifacts: TV news. We address the research hypothesis that news of this kind increasingly employs strategies of audiovisual narrative that may undercut, undermine or construct ideological positions and evaluations of news content independently of what may be simply stated, for example, in accompanying spoken text. The increasing sophistication and technical possibilities available to news channels makes it possible to draw on techniques for storytelling well established in film, but this can by no means always be guaranteed to work as intended. Indeed, filmic storytelling techniques may also serve to more effectively dis-inform.

Although consideration of news reporting as narrative already has a considerably history and is nowadays hardly controversial (Sperry, 1981; Bell, 1999; Liebes, 1994; Langer, 1998; Hickethier, 2000; Dunn, 2005; Machill et al., 2006), methods for engaging specifically with their audiovisual non-verbal properties remain limited. Analyses addressing the audiovisual are overwhelmingly based on small datasets and, as is usual within the humanities, are performed interpretatively. Consequently, just how widespread ‘narrativization’ strategies are, and to what effect and aims they are being applied, urgently demands in depth and larger-scale research from within the digital humanities.

Case study: Tagesschau vs. Bild TV

The current paper presents some early results of the FakeNarratives project illustrating how narrative audiovisual strategies might begin to be addressed at variable scales. Several distinct kinds of narrative strategies have been discussed in the journalism and newsreporting literature, but for the purposes of our first case study we focus on the production of ‘individual-centered narrative’ by means of editing devices such as showing layperson’s talk, individualized events and actions, close- ups of individual faces, and emotion shown either visually or exhibited in accompanying language. These features appear to be widely used across alternative and public news outlets (Vettehen et al., 2008). Our research hypothesis is that both kinds of news outlets use narrative strategies to some degree, but that there may still be significant differences in why and how often they are actually used. We consider it highly implausible that the use of narrative strategies can be reduced to any binary “good vs. bad” or “informing vs. dis-informing” characterisation and so more finely discriminating accounts are in any case necessary. The basic challenge we address here is then to set out how hypotheses concerning narrative construction may be made amenable to larger-scale data analysis.

Here we select some of the most commonly discussed narrative features in news that contribute to highlighting individual stories, a technique that can effectively increase the viewers' memories of news contents and their perception of news severity (Aust & Zillmann, 1996; Zillmann & Brosius, 2012) . We examine whether and how these features are presented in both public and online alternative news channels. In particular, we consider how these features are employed to individualise, personalise or emotionalise people shown in news videos. In other words, we seek to operationalise just what constitutes the narrative strategy of individualisation so that we can explore how the strategy is used across our selected news channels.

Data

Our data include a preliminary selection of 166 news videos, 70 videos from Tagesschau, which we use as an example for more traditional, serious media, and 96 comparable videos from Bild TV, which we use as an example for a more recent, alternative news channel. These videos were all produced between 1 January 2022 and 15 March 2022. The length of entire news programs in both channels is around 15-25 minutes. For current purposes, we analyze the first news report of each news video, namely, the top news story of the entire program, which usually lasts from 3 to 5 minutes. The most reported top news themes from January to the middle of March in 2022 were Covid-19 and the outbreak of the Ukraine war.

Method

For systematically analyzing the news videos, we employed the multimodal discourse approach of film cohesion (Tseng, 2013), which operates by picking out how the deployment of visual image, sound, verbal language, written language, camera movement, framing, color, and many more pattern together so as to introduce and coherently track people, places, objects and events within an unfolding event sequence. In other work, we have found that the cohesion structures resulting from such analyses appear to play significant roles for scaffolding an audience’s perception and understanding of narratives (cf., e.g., Tseng et al., 2021).

Five distinct types of people commonly depicted in news reported were singled out for attention: (1) anchor persons, (2) reporters, (3) news commentators, (4) protagonists in the reported news events and (5) laypersons. The first three traditionally represent the authoritative voice in news and so we track and analyze whether these voices are personalized by including their own attitudes and emotions. The fourth type, protagonists, includes the main involved characters of news events, whereas the fifth type, laypersons, refers to people represented as generic exemplars effected by the news event.

  • Layperson in [talking head] mode: whether a news video employs layperson’s interviews.
  • Specific protagonist: whether a particular individual in the news report is cohesively identified as a specific protagonist.
  • Putin: whether the Russian president Putin is depicted as protagonist co-patterned with particular personalized quality and emotional features.
  • Close-up of laypersons and protagonists: whether laypersons or protagonists are additionally shown with close shots.
  • Self references of anchors, reporters or commentators: whether the ‘authoritative’ voices make recurring uses of self-references, such as the uses of “I”, “my”, “me” suggesting subjective opinions or personal attitudes of the journalists.
  • In addition to these five multimodal categories derived directly from the cohesive chains concerning individuals, we also analyze how these individuals are combined with the following four narratively relevant features:

  • Conflict or violent events: whether any conflict-related event is being mentioned, seen or written in news reports.
  • Negative evaluation: whether particular, recurring quality features with negative evaluation can be found in news reports, for instance, the negative descriptions about the war or politicians.
  • Emotions: whether there are multimodal realization of emotions, for instance, emotional terms mentioned in spoken or written texts and emotional reactions shown in visual images or sounds.
  • Wallpapering: whether, in addition to the individuals and their evaluative features derived from the analysis of cohesive chains, background images that are not strictly related to the contents of the news report running in the foreground are shown.
  • The process of deriving the applicability of these categories for a video segment on the basis of the analysis of cohesive chains is depicted graphically in Figure 1.

    Placeholder
    Illustrative example of cohesive chains (right) developed on the basis of the unfolding video segment (Bild TV news, 25.02.2022, left). The chains show how the identities and elements tracked may occur in any modality and that their combination provides direct support for the narratively relevant features being targeted.

    Initially, we coded and analyzed these nine categories in 166 video segments manually. Each segment was coded as ‘1’ when the category applies and ‘0’ when it does not apply. For simple coding of single news stories, the analyses could be entered into spreadsheets; for more complex coding, either of further narrative strategies or including more of the videos beyond the first news story, more structured annotations are essential, typically performed currently using tools such as ELAN (ELAN, 2021).

    Results and discussion

    The occurrences of the nine categories in our two selected TV channels are displayed in Figure 2. This shows the complex similarities and differences of the two channels when employing the distinct fine-grained strategies contributing to individualization. From the differing distributions among the fine-grained strategies, it becomes clear that, although both channels might be said to be employing individualization as a technique for raising the interest-value of their reports, they do this in differing ways. For example, the occurrences of closeup faces and the talk of laypersons show no significant difference across the two channel (Fisher’s exact test, p=0.101 and p=0.324 respectively); thus both channels can be seen to use the individualizing feature of closeups and laypeople reports. However, the comparison also shows that Tagesschau uses significantly more protagonists to report news events than does Bild TV (Fisher’s exact test, p=0.001). This means that Tagesschau actually individualizes news stories more than Bild TV does, but in a very specific way. Nevertheless, the result in the category of Putin indicates that Bild TV does specifically individualize Putin significantly more than Tagesschau. Indeed, in our data, Bild TV largely labels the Ukraine war or the violent conflict as Putin’s war, while Tagesschau mostly refers the war to the ‘invasion of Russia’ or Russian soldiers. Figure 2 also shows that the categories of conflict events, emotional reaction, negative evaluation and wallpapering all pattern similarly with significantly more use in Bild TV than in Tagesschau. Moreover, the substantial use of self references of reporters in Bild TV indicates the common insertion of personalized attitudes and opinions into their news reports.

    In summary, Tagesschau minimizes Putin’s personification, negative evaluation, decorative editing and reporters’ self-references, presumably in order to balance its heavy use of protagonist- centered narrative model with objective representation and visualization, while the Bild TV news videos employ evaluative and decorative features to dramatize the news events.

    Placeholder
    The percentage occurrences of the nine categories in Tagesschau and Bild TV and corresponding p-values indicating the significance of difference (Fisher’s exact test).

    Apart from the narrative strategy of individualisation, we are currently analysing and formalising other crucial news narrative strategies such as dramatisation and fragmentation (Bennett, 1988). Nevertheless, for enlarging the corpus of news videos and the scale of annotation categories, we require more effective annotation tools beyond the time-intensive manual annotation methods such as ELAN. In this pursuit, automatic annotation is one central goal of the FakeNarratives project.

    What’s ahead: Automatic annotation and exploratory visualization tool

    The results of the previous section are encouraging concerning the use of more abstract audiovisual analyses as a means for revealing differences and similarities between audiovisual materials in ways that naturally relate to issues of narrativization. The characterizations of our selected news media clearly indicate where differences in broader strategies are occurring. It is also nevertheless clear that for more reliable and extensive results, expanding both the kinds of audiovisual materials analyses and the kinds of strategies at issue, it will be essential to move beyond time-intensive manual annotation. Augmenting this work further by the development and application of automatic and semi-automatic annotation is consequently a further central goal of the FakeNarratives project, for which we adopt a two-pronged approach combining foci on visualization and automated analysis.

    We now combine a range of computational techniques for which there are already good solutions available, such as shot detection, face detection, etc., into a flexible tool named Zoetrope, supporting the progressive annotation of larger data sets by means of a more interactive interface (see Figure 3). Zoetrope enhances the existing landscape of tools for audiovisual annotation and analysis (for an overview see Pustu-Iren et al., 2020), as it is not only capable of visualizing automatic annotations, but also provides some basic functionalities for querying the video, for instance for specific key words or named entities. By these means, annotation becomes an activity that can leverage both automatic results of audiovisual processing and more abstract characterizations of data in terms such as cohesive structures as used above.

    The current Zoetrope 1.0 prototype already allows researchers to search for keywords ( segment 1). Any keyword can be used as a query, which will then be searched in the spoken (Mozilla DeepSpeech2  with a German model by Agarwal and Zesch3 ) and written (scene detection framework easyOCR4 ) language of the video. We also embed the query using word embeddings5  so that we can find words semantically related to the query as well. Via a slider, the similarity threshold can be set looser or more exact (=100%). The results of such a query are visualized in segment 2, which basically uses a timeline metaphor. Query results found in spoken language are rendered in green, results in written language are rendered in red. Note that for the query “Krieg” we also find semantically related concepts such as “Truppen”, “Invasion” or “Militär”, due to the semantic similarity threshold, which was set here rather low, at 50%. In segment 3, we see some exemplary automatically determined features, including face detection and audio analysis by means of a spectrogram. Finally, segment 4 shows an interactive video player, which can be navigated by means of the timeline or by clicking on specific results, such as a keyword or a face. Visible features, such as written keywords or faces, can also be rendered with their bounding box withing the video.

    Placeholder
    Zoetrope 1.0 prototype for visualizing and exploring news videos.

    The tools developed within the project will continue to pursue advanced visualization and more extensive automatic analysis. The latter goal is supported by our project partners in Hanover and their continuing development of the web-based audiovisual analytics platform TIBAVA (TIB AV- Analytics).

    Conclusions and Outlook

    We have shown in this paper some of the benefits that may accrue when the analysis of audiovisual media can build on more abstract discourse patterns carried out at scale. We have argued that to take this further, however, a progressive increase in computational support is necessary. To this end, we have set out some first steps taken towards bridging some of the gaps between current technical feature processing possibilities and discourse-related characterizations of data. This is being enabled first and foremost by an interactive tool which already seamlessly incorporates a variety of state of the art processing techniques relevant for more abstract annotation practices. We believe such a tool will greatly speed up the manual annotation that our current corpus analysis relies on and will allow us to discover and annotate many more narrative strategies in the future. Once the tool is beyond its prototype stage, we also plan to release it for the DH community, which might use it to search other video material than news videos, for instance YouTube Let’s Play videos for game studies, or any movie or TV series for film studies scenarios. A live demo of the prototype will be included in the presentation at DHd 2023.


    Fußnoten

    1 https://fakenarratives.github.io/index
    2 https://github.com/mozilla/DeepSpeech
    3 https://github.com/AASHISHAG/deepspeech-german
    4 https://github.com/JaidedAI/EasyOCR
    5 Using spaCy’s de_core_news_md model

    Bibliographie

    • Aust, Charles & Dolf Zillmann. 1996. “Effects of Victim Exemplification in Television News on Viewer Perception of Social Issues”. Journal of Mass Media and Communication Quarterly 4(73), 787-803.
    • Benett, Lance W. 1988. “News: The Politics of Illusions”. New York: Longman.
    • Bell, Allan. 1999. “News stories as narratives.” In A. Jaworski and N. Coupland (Eds.), The Discourse Reader, Chapter 13, 236–251. London: Routledge.
    • Dunn, Anne. 2005. “Television news as narrative.” In H. Fulton, R. Huisman, J. Murphet, and A. Dunn (Eds.), Narrative and Media, 140–152. Cambridge, UK: Cambridge University Press.
    • ELAN. 2021. “ELAN – Linguistic Annotator (Version 6.2).” computer software. Nijmegen: Max Planck Institute for Psycholinguistics. The Language Archive.
    • Heftberger, Adelheit. 2018. “Digital Humanities and Film Studies: Visualising Dziga Vertov’s Work.” Switzerland: Springer Nature.
    • Hickethier, Knut. 2000. “Fernsehnachrichten. Geschichten aus 1001 Nachricht.” Message 2(2), 70–74.
    • Langer, John. 1998. “Tabloid Television: Popular Journalism and the ‘Other News’.” London and New York: Routledge.
    • Liebes, Tamar. 1994. “Narrativization of the News: An Introduction.” Journal of Narrative and Life History 4(1– 2), 1–8.
    • Machill, Marcel, Sebastian Köhler, & Markus Waldhauser. 2006. “Narrative Fernsehnachrichten: Ein Experiment zur Innovation journalistischer Darstellungsformen.” Publizistik 51(4), 479–497.
    • Mirzoeff, Nicholas. 1999. “An introduction to visual culture.” London & New York: Routledge.
    • Mitchell, William. 2001. “What is visual culture?” In Meaning in the visual arts: views from the outside, 202–217. Princeton: Institute for Advanced Study.
    • Pustu-Iren, Kader, Julian Sittel, Roman Mauer, Oksana Bulgakowa & Ralph Ewerth. 2020. "Automated Visual Content Analysis for Film Studies: Current Status and Challenges". Digital Humanities Quarterly, 14(4).
    • Sperry, Sharon L. 1981. “Television News as Narrative.” In R. P. Adler (Ed.), Understanding Television: Essays on Television as a Social and Cultural Force, 295–312. New York: Praeger.
    • Tseng, Chiao-l. 2013. “Cohesion in Film: Tracking Film Elements.” Basingstoke: Palgrave Macmillan.
    • Tseng, Chiao-I., Jochen Laubrock, & John A. Bateman. 2021. “The impact of multimodal cohesion on attention and interpretation in film.” Discourse, Context & Media 44, aop.
    • Vettehen, Paul, Koos Nuijten, & Allerd Peeters. 2008. “Explaining effects of sensationalism on liking of television news stories: The role of emotional arousal.” Communication Research 35, 319–338.
    • Zillmann, Dolf & Hans-Bernd Brosius. 2012. "Exemplification in Communication the influence of Case Reports on the Perception of Issues". Routledge.