Catherine Hancatherinehan@cs.stanford.eduStanford UniversityUSA,Anne Lianne24@stanford.eduStanford UniversityUSA,Deepak Kumarkumarde@ucsd.eduStanford UniversityUSAUniversity of California San DiegoUSAandZakir Durumericzakir@cs.stanford.eduStanford UniversityUSA
(January 2024; April 2024; May 2024)
Abstract.
Social media has become a critical tool for journalists to disseminate their work, engage with their audience, and connect with sources. Unfortunately, journalists also regularly endure significant online harassment on social media platforms, ranging from personal attacks to doxxing to threats of physical harm. In this paper, we seek to understand how to make social media usable for journalists who face constant digital harassment. To begin, we conduct a set of need-finding interviews with Asian American and Pacific Islander journalists to understand where existing platform tools and newsroom resources fall short in adequately protecting journalists, especially those of marginalized identities. We map journalists’ unmet needs to concrete design goals, which we use to build PressProtect, an interface that provides journalists greater agency when engaging with readers on Twitter/X. Through user testing with eight journalists, we evaluate PressProtect and find that participants felt it effectively protected them against harassment and could also generalize to serve other visible and vulnerable groups. We conclude with a discussion of our findings and recommendations for social platforms hoping to build defensive defaults for journalists facing online harassment.
online harassment, online communities, interface design, journalists
††copyright: acmlicensed††journal: PACMHCI††journalyear: 2024††journalvolume: 8††journalnumber: CSCW2††article: 509††publicationmonth: 11††price: 15.00††doi: 10.1145/3687048††ccs: Human-centered computingUser centered design††ccs: Human-centered computingSocial media††ccs: Human-centered computingEmpirical studies in collaborative and social computing††ccs: Security and privacySocial aspects of security and privacy1. Introduction
Journalism has always been a dangerous profession. From 2016 to 2021, the United Nations Educational, Scientific and Cultural Organization (UNESCO) reported that 455 journalists were killed for their work or while on the job(UNESCO, 2021). The online landscape of contemporary journalism has raised further concerns of novel threats to journalists’ safety and expression, including doxxing, swatting, coordinated harassment campaigns, and dogpiling(Posetti etal., 2022; Quodling, 2015).Such online violence disproportionately affects marginalized-identity journalists: in 2020, UNESCO found that 73% of women journalists reported experiencing online violence in the course of their work(Posetti etal., 2022). Likewise, the perceived lack of systemic support from news organizations and social platforms has burdened journalists, especially women and minority-identity journalists, with the responsibility of protecting themselves(Avery E.Holton and Molyneux, 2023; Chen etal., 2020). As a direct result of online violence, the constant vigilance required to monitor it, and the lack of resources to handle it, journalists have reported experiencing fatigue, burnout, or psychological trauma and have expressed a desire to exit the profession altogether(Lee and Park, 2023; Chen etal., 2020).
Journalists are subject to various standards for social media engagement that have made existing online harassment resources difficult or impossible to use. Oftentimes both explicit and implicit, journalistic standards for social media require that journalists consume or even engage with potentially harmful content to do their jobs.Not only do news organizations subject journalists to social media policies that encourage journalists to maintain an “active” and “authentic” online persona(Nelson, 2021), but also journalists’ role in society obliges that they fulfill a social contract to inform the public(Sjøvaag, 2018).Prior work has proposed systems to protect social media users more broadly against online harassment, leveraging community support to curate word or account blocklists(Jhaver etal., 2018, 2022; Party, 2023a; Together, 2023). However, simply filtering content or accounts does not address journalists’ specific needs to monitor the comments they receive and remain accessible to the public. The tension between these professional standards for participation and unmet safety needs in online spaces has left journalists struggling to manage their online presence.
In this paper, we begin by formalizing the specific online safety needs of journalists. We conduct semi-structured interviews with eight Asian American and Pacific Islander (AAPI) journalists to center the needs and concerns of a marginalized journalist group; our approach aligns with prior CSCW work that advocates for centering vulnerable populations to design technological systems that are more robust to abuse, better serving all users(Blackwell etal., 2018). In our interviews, we focus on the nuances of their harassment experiences, protective strategies, and moderation practices. We coupled these interviews with a closed card sorting exercise to investigate journalists’ decision-making processes when interfacing with harassment. We found that, at the time of our interviews in 2022, journalists heavily relied on social media platforms like Twitter/X for work, but these platforms were also the primary vectors for online harassment. Still, journalists expressed a high willingness to interact with users critiquing their stories and an appetite for understanding their audience response, even if their language was negative or passionate. As a result, journalists often manually weigh each of their comments, considering its relevance to their work and how reasonable the commenter appears, to decide how they might engage.Even when faced with abusive interactions, some journalists were hesitant to leverage existing platform moderation mechanisms (e.g., blocking or muting accounts). Because of the affordances of blocking on Twitter/X111Twitter/X indicates to a user when they have been blocked. and the fundamental concept of blocking (denying access to view or interact with a user’s content), journalists felt that blocking would not only empower their attackers as a signal of successful harassment, but also be antithetical to their duty to inform the public.
Building upon our findings, we design and evaluate PressProtect, an interface for journalists to effectively use social media in the face of significant harassment.PressProtect classifies online engagement along two axes: (1) the relevance of a comment to the journalist’s post/story and (2) its toxicity. We use few-shot prompting of a large language model, GPT-3.5, to determine a comment’s relevance to a given story. We combine this with Google Jigsaw’s Perspective toxicity classifier(per, 2023) to determine if a comment is toxic.The PressProtect interface displays non-toxic content by default, stowing “relevant and toxic” and “irrelevant and toxic” content behind additional user interface (UI) controls to give journalists agency in deciding when to engage with potentially harmful content.For instance, while the “relevant and toxic” category may include offensive or inflammatory language, it may also contain information or feedback that is useful for journalists to process. Because PressProtect is a client-side interface that uses content-based features, it does not suffer from the drawbacks of platform-based account blocking that have made this mechanism unusable for some journalists.
To evaluate PressProtect, we conduct a user study with a separate set of eight journalists whose online harassment experiences on Twitter/X span a wide range, from receiving hate comments to being harassed by a stalker or targeted by a nation-state adversary. Through user testing, we explore how they interacted with content displayed through PressProtect’s categorizations, feedback about its logic, and their reflections on incorporating such a system into their workflow.We find that PressProtect’s logical abstraction of both the toxicity and the relevance of comments provided a nuanced lens for viewing reader engagement that effectively protected journalists (especially marginalized-identity journalists or journalists reporting on controversial topics, such as climate change or immigration); journalists also felt that PressProtectcould generalize to serve other populations of users in the public eye (e.g., politicians, celebrities). Finally, through testing, we observe the emergence of a critical need for journalists: effectively flagging imminently dangerous threats to be addressed separately from other kinds of harassment.
We draw upon our observations from the design and evaluation of PressProtect to discuss their implications for social media platform design and opportunities for platforms, newsrooms, and third-party developers to collaborate on better protecting journalists. Our exploration of journalists’ experiences and deficits in existing resources for online harassment illuminate various dimensions to their needs, distinguishing them from those of other at-risk populations. More broadly, we argue that both platforms and researchers consider the multi-dimensionality of needs across different vulnerable populations to design safer online experiences for diverse user populations.
2. Related Work
Extensive literature spanning disciplines, including computer science, communication, and in the case of our work, journalism, has investigated various aspects of online harassment. We build on three primary bodies of related work to inform our design of PressProtect. First, we review the landscape of online harassment experiences for vulnerable communities (e.g., women, content creators, sex workers). Second, drawing upon journalism literature, we discuss the specific concerns of journalists facing online harassment. Finally, we review the different systems and tools that have been proposed to combat online harassment more broadly.
2.1. Understanding community experiences of online harassment
Online harassment is a common experience: as of 2021, 48% of people globally have experienced harassment in some form(Thomas etal., 2021). Significant research has focused on the communities that experience disproportionately severe or frequent online harassment, including gender or religious minorities(Nadim and Fladmoe, 2021; Vitak etal., 2017; Powell etal., 2020; Salehi etal., 2023),politicians(Gorrell etal., 2018; Hua etal., 2020; Krook and Sanín, 2020), content creators(Thomas etal., 2022; Samermit etal., 2023), and sex workers(McDonald etal., 2021).Prior work has investigated the differences in harassment responses by gender, finding that targeted women are more likely than targeted men to manage their online presence by silencing themselves or blocking others(Nadim and Fladmoe, 2021; Vitak etal., 2017).222Similar to prior work on harassment and intimate partner violence, we intentionally avoid the term “victim” to not disempower people facing harassment(Thomas etal., 2021; Kumar etal., 2023; Bellini, 2023). Other research has suggested that politicians preemptively block highly adversarial users by analyzing their profile metadata(Hua etal., 2020) and described how content creators felt that improved content removal mechanisms would effectively protect their personal and community safety(Samermit etal., 2023). Similarly, Salehi etal.examine how visible Muslim Americans (e.g., journalists, activists, aspiring politicians) continue to engage online despite persistent harassment by leveraging protective platform affordances, such as blocking accounts or limiting who can engage with social media posts (e.g., only people you follow can reply to your tweet). However, they also discuss the fundamental limitations and negative externalities of content moderation for marginalized populations — that is, that the harm is systemic and that content removal policies can be abused by attackers to further harass their targets.
Recent work from the CSCW community has taken an intersectional approach to studying online harms and moderation(Gilbert, 2023); prior work supports this direction by highlighting how intersectionality shapes users’ harassment exposure and harm reduction tactics(Wagner, 2022; Krook and Sanín, 2020). Related, Schoenebeck etal.(Schoenebeck etal., 2023) detail how harassment experiences of people in non-majoritarian (i.e., outside of North American and European) countries can help shape platform design that aligns with regional values.We draw upon this body of work to guide our understanding of journalists’ online harassment experiences and inform our approaches to designing a system that caters to their unique online safety concerns.
2.2. Needs of journalists facing online harassment
Journalistic reciprocity is an idea that situates journalists as community-builders that can directly, indirectly, and repeatedly over time, exchange with their readers to develop trust, connectedness, and social capital(Lewis etal., 2014).As social media has become the main medium for reciprocal journalism, online violence against journalists, particularly women, has emerged as a major threat to press freedom(Lorenz, 2023; Lewis etal., 2020). Prior work has highlighted the threats journalists face by virtue of their occupation in an increasingly online world, including coordinated attempts to silence their freedom of expression, manufactured controversies to ruin career opportunities, and calls to action for their physical harm(Holton etal., 2023; Tandoc etal., 2023; Ferrier and Garud-Patkar, 2018; Chen etal., 2020; Lorenz, 2023). Lewis etal.explored what factors lead journalists to face more online harassment, such as gender and “personal visibility,” which is the degree to which personal traits (e.g., one’s face or voice) are presented alongside or as a part of their work(Lewis etal., 2020).However, journalists cannot simply retreat from the Internet. Online journalistic reciprocity has been normalized and often required as a part of their jobs(Chen etal., 2020; Nelson, 2023). Because of this, the disproportion of harassment experiences has placed outsized burden on women journalists to decide if and how to respond to online harassment; while the most common protective action by journalists facing online harassment is to stop engaging, this response further disadvantages marginalized journalists by silencing their voices or hindering their career development(Lewis etal., 2020). Withdrawing from social media could also potentially alienate audiences; some journalists feel that responding to online attacks is essential to maintaining their journalistic integrity; and in some choice cases, journalists have even been fired for non-participation on social media(Nelson, 2023). As such, journalists have a unique relationship with social media where they must endure receiving online harassment because of their employers’ policies. We explore these tensions and concerns through our need-finding interviews in Section3 and incorporate these findings into our interface design.
2.3. Systems and tools to combat online harassment
An expansive area of social computing literature has proposed many approaches for combating online harassment. Goyal etal.introduced a framework to reason about these various protective systems across several axes: Prevention (precautionary measures), Monitoring (understanding harassment activities), Crisis (immediate response needs), and Recovery (mitigating impacts)(Goyal etal., 2022).Within Crisis and Recovery, systems like HeartMob(Blackwell etal., 2018) and Trollbusters(Ferrier and Garud-Patkar, 2018) offer targets a sense of community and support during acute instances of online harassment. Other processes, such as the one proposed by Xiao etal.(Xiao etal., 2022), offer a restorative justice-inspired pathway to enable targets facing interpersonal online harm to make sense of their experiences. Goyal etal.’s own tool, Harassment Manager, seeks to provide journalists, among other users, a streamlined method to document instances of online harassment(Goyal etal., 2022).
Most relevant to our work are both academic research and deployed industry tools that focus on Prevention and Monitoring systems.Squadbox is one such tool that focuses on the application of “friendsourcing” — the idea that harassing messages (in this context, e-mail) can be filtered and later reviewed by a trusted third-party to protect the user from firsthand exposure to harassment(Mahar etal., 2018). Recent work from Jhaver etal.builds on these ideas in FilterBuddy, a system for content creators to create, share, monitor, and delegate the administration of word filters to combat harassment on their YouTube pages. Both Squadbox and FilterBuddy employ participatory design, a technique often used by CSCW and CHI researchers(Ashktorab and Vitak, 2016). In addition to tools proposed by researchers, there have also been a variety of deployed industry solutions for online harassment. These tools range from community-curated account blocklists(Together, 2023) to collaborative systems that use custom machine-learning classifiers(per, 2023) to aid human moderators in identifying abuse(Jigsaw, 2016).
Tools like Block Party provide Twitter/X users with more fine-grained mechanisms for rule-based blocking of accounts, such as if an account has interacted with a problematic tweet or account or if it has a low follower count. These features can also be used to control notifications and the display of content from suspicious or harassing accounts(Party, 2023a); however, the increasing tumult surrounding Twitter/X and the surge in enterprise API pricing have since forced out third-party tools for the platform, like Block Party(Party, 2023b). As we discuss in Section5, we identify components of these systems, such as friendsourcing, that some journalists employ. In Squadbox, participants were frustrated by the lack of agency in deciding if or when they would be exposed to harassing content. Likewise, in FilterBuddy, participants prioritized control so highly that they were willing to trade greater automation for more control over the tool’s options. Further, the protective platform affordances (i.e., blocking and turning off comment access to non-friends on Facebook) that Salehi etal.find are most preferred by users are also rooted in the desire to control who can engage with their accounts. Based on these findings, it is resoundingly clear that users want control over their online experiences — both in how they encounter harmful content and in how they prefer to use protective tools. We explore the extent to which journalists are able to benefit from this suite of existing tools and use these observations in a participatory design process to develop a system that better addresses journalists’ specific needs.
3. Need-finding Interviews
To mindfully design an anti-harassment tool that meets the unique needs of journalists, we conducted semi-structured need-finding interviews with eight journalists (J1-J8) that self-identified as having experienced online harassment.Through these interviews, we aimed to better understand journalists’ experiences with and current strategies for handling online harassment. We also use the open-ended nature of these interviews to explore deficits in existing mitigation strategies or tools and how they might be addressed. We distill these observations into a set of needs (Section3.2.4) that inform our interface design for a re-imagined and improved social media experience for journalists.
3.1. Study Design
We recruited participants from the Asian American Journalists Association (AAJA) convention in July 2022.Following the suggestions of prior work in the HCI community(Ashktorab and Vitak, 2016; Schoenebeck etal., 2023; Scheuerman etal., 2018; Blackwell etal., 2017; Han etal., 2023), we recruited from an organization that serves a minority identity in the United States (i.e., Asian American and Pacific Islander). By centering marginalized-identity journalists’ experiences, we use a “bottom-up” approach as suggested by Blackwell etal.to better design protections for all users(Blackwell etal., 2017).We recruited using two primary methods: (1) we tweeted using the hashtag for the conference attendees, #AAJA22, and (2) we posted in the conference Slack for all attendees. In both methods, we asked journalists to contact us (via e-mail or Twitter/X direct message) if they have had personal experiences with online harassment. We scheduled either in-person or video interviews with all of the journalists that reached out to us.Our recruitment efforts yielded eight participants, and while we acknowledge the limitations of our sample size (Section6.3), we concluded our need-finding recruitment when we felt we had reached adequate saturation(Saunders etal., 2018) — that is, new themes did not emerge from interviews with additional participants — to reason about journalists’ harassment experiences and protective behaviors.Although we did not collect other identity markers like sexuality or age, we collected details on their years of experience in journalism and the kinds of stories they cover.Participants came from a wide range of professional backgrounds, from independent journalists to those in major newsrooms, and their coverage spanned various beats, or specialized reporting for a particular topic (e.g., politics, breaking news, sports); we report more detailed participant information in Table1.We formulated our interview questions to explore preliminary topics of interest, including how journalists use social media platforms professionally, what forms of online harassment they have experienced and how those experiences impacted them, and what protections they use for online safety and how they seek such resources. As participants spoke about their experiences, we prompted them to provide specific examples of incidents and asked follow-up questions to better understand the motivations of their actions or behaviors.The full list of primary questions is included in AppendixA.
Demographic | Group | N (out of 8) |
Beat | Local News | 2 |
Climate | 1 | |
Investigative Reporting | 1 | |
Breaking News | 1 | |
Culture | 1 | |
International Relations | 1 | |
College Football | 1 | |
Years of Experience | 1-3 | 4 |
4-6 | 2 | |
7-9 | 2 | |
Social Media Platforms | Twitter/X | 8 |
Used for Reporting | 4 | |
3 | ||
1 | ||
Gender | Woman | 7 |
Man | 1 |
3.1.1. Mention Sorting Exercise
To supplement our semi-structured interviews, we incorporated a closed card sorting exercise with each of the eight journalists we spoke to. The cards for this exercise were categories for how the journalist would prefer to interact with online reader engagement. We began this exercise only after the interview portion to avoid restricting participants’ frame of mind during open-ended exploration. In this exercise, we curated a set of 10 abusive Twitter/X mentions that were previously sent to a prominent journalist on Twitter/X. We chose this set of mentions to span various forms of harassment, including personal attacks, critical article commentary, and sexually explicit content, to understand similarities or differences between how journalists process each of them. For every mention, we asked journalists to “talk out loud” as they sorted it and probed for more details when relevant to more concretely understand journalists’ decision-making process for interacting with or moderating abusive engagement. The category cards were as follows:
- (1)
The tweet is displayed as normal; no moderation action is taken.
- (2)
The tweet is moved to a separate area before the journalist views it, as if in a spam folder.
- (3)
The tweet is removed from view by default; the journalist prefers to never have seen it.
Interviews lasted between 20minutes and an hour, with an average length of 35 minutes, depending on the degree of participants’ exposure to harassment and their willingness to elaborate on their experiences. Participants were compensated with an Amazon gift code worth $30.
After conducting the interviews, two researchers independently coded the transcripts into higher-level themes, such as “social media platforms” and “mitigation strategy” (Table2) using thematic analysis. The initial set of category labels were guided by the primary topics explored in the interview questions (AppendixA), but labels were iteratively added to the codebook when interview excerpts did not fit existing labels. Although we do not use these labels as a basis for describing our results quantitatively, we use inter-rater reliability (IRR) as a process to reduce confirmation bias, grounded in McDonald et al.’s guidelines for using IRR(McDonald etal., 2019). Inter-rater agreement for this codebook was high, with a Kupper-Hafner agreement metric(Kupper and Hafner, 1989) of 0.83. Both researchers met to resolve conflicts and obtain a final, coded interview dataset.
Code | Meaning |
---|---|
Harassment trigger | An event or action that triggers a flood of harassment from one or many users |
Social media platforms | Discussion on what social media platforms they use and in what manner for their work |
Tolerance threshold | Factors behind the decision-making of whether or not to engage with users or moderate their content (e.g., severity, violent threat) |
Harassment vector | What vectors attackers use to harass journalists (e.g., e-mail, comments on social media platforms) |
Kinds of hate | Type of hate that was operationalized in an attack against them (e.g., racism, sexism, xenophobia) |
Mitigation strategy | Current strategies employed to combat harassment |
Filtering & moderating concern | Concern related to the effect or implication of filtering user content from their view or moderating users or their content |
Resource-seeking strategy | Current strategies for seeking resources or community support for online harassment |
3.1.2. Ethical Considerations
We interviewed journalists from minority communities about potentially triggering experiences with online hate and harassment. Because of the sensitive nature of this material, we fully informed participants of the purpose of the research and the kinds of questions we planned to ask before beginning the interviews. We received verbal consent before beginning the interview and reminded participants that they could decline to answer any questions or stop the interview at their discretion. We removed any potentially personally identifiable information (PII) from any quotes we included in this paper. This research was approved by our Institutional Review Board (IRB).
3.2. Interview Findings
In this section, we present the results from our exploration of how journalists experience online harassment, what journalistic norms govern their online interactions, and what strategies journalists employ to protect themselves. We synthesize these findings into a set of currently unmet safety needs and use this to guide the design of a protective interface that helps restore agency to journalists in the face of significant harassment (Section4).
3.2.1. Characterizing journalists’ online harassment experiences
Various threat models for online harassment
We begin by investigating how journalists experience online harassment. Six participants described attacks on their journalistic credibility or integrity as a major form of harassment. Consistent with intersectionality theory(Crenshaw, 1991; Collins and Bilge, 2016), all of our participants’ harassment experiences as journalists were shaped by various facets of their identity, including their race, gender, sexuality, and nationality. For instance, J5 recounted an experience where they were accused of conspiring with a foreign government agenda because of their foreign nationality as a journalist reporting on international politics in the United States.We next sought to understand what might trigger harassment. Seven of the eight journalists pointed to the release of a new story as the main entry point for online attacks, but the attacks they experienced varied significantly in burstiness and duration. Several journalists described attack patterns where they would receive a sudden flood of emails or replies on Twitter/X in response to the release of a story. Others dealt with persistent attackers that continued to harass them long after the initial story of interest, tracking the journalists’ various accounts and emails even as they moved to different newsrooms during the course of their career.When asked which vectors were most commonly used for harassment, over half of our participants mentioned Twitter/X; other platforms included more direct, private forms of communication, including WhatsApp messages, Facebook messages, text messages, and e-mails. Four journalists pointed to replies to their tweets as the most common vector for harassment.Two journalists experienced more coordinated, cross-platform harassment campaigns (J3, J7). J3 recounted that a “handful of ringleaders” directed users on one platform (Reddit) to execute attacks across their accounts on other platforms:
“I experience harassment on Twitter, Instagram, and Reddit, but I think I miss a lot of what happens on Reddit because I choose not to look at it. A lot of it happens on Asian American, misogynist subreddits, where…they talk about me by name with some frequency. Sometimes, I think people that are either active on Reddit or Twitter would find [my personal Instagram] and tag me in hateful posts or comments that are quite abusive.” – J3
The characteristics of these more complex attacks, such as their motivation, coordination, and execution across various platforms, parallel the experiences of other at-risk communities. For instance, Jhaver et al. examined how brigading, a coordinated attack by one online group on another, often occurs on Reddit and Twitter/X due to opposing ideologies and regularly targets marginalized groups (e.g., women, abuse victims)(Jhaver etal., 2018). Prior CSCW research has also investigated the properties of hate raids, which are attacks that flood a target’s chatroom with hateful messages, on the livestreaming platform Twitch(Han etal., 2023; Cai etal., 2023). In more severe cases, these hate raids can also be coordinated on or spread to other platforms. Furthermore, both J3 and hate-raided streamers felt that their marginalized identities (e.g., gender, race, sexuality) motivated their attackers to select them as targets. Therefore, mirroring the characteristics of previously studied attacks on other platforms, the threat model described by J3 and J7 situates itself within what Marwick describes as morally motivated networked harassment: that is, harassment that leverages the amplification of a cross-platform, networked audience “justified” by the target’s violation of identity norms(Marwick, 2021).
Extending beyond psychological harms
In addition to experiencing cross-platform, coordinated attacks, half of our participants detailed experiences where online harassment escalated to offline harms, which have been been well-documented in the literature(Lewis etal., 2020; Holton etal., 2023; Tandoc etal., 2023; Ferrier and Garud-Patkar, 2018; Chen etal., 2020; Lorenz, 2023). These harms threatened participants’ physical safety through doxxing and violent language, which two participants described as sexually violent in nature. One journalist discussed how a well-resourced and motivated adversary, such as a nation-state, can deeply complicate these threats, even endangering the physical or financial safety of their loved ones.Another journalist (J7) detailed how they became a target of harassment by someone who was initially a source for a story. J7 explained that they were accused of racism because of a miscommunication with a representative of an organization they were interviewing, which led to a coordinated attempt by that organization to harm J7’s reputation. These public accusations were conducted across several of J7’s social media accounts — both personal and professional — and cost J7 a job opportunity.
Informed by prior work, we find that our participants’ concerns over these threats are shared by other vulnerable populations online, such as content creators(Thomas etal., 2022) and visible Muslim Americans(Salehi etal., 2023). All of these populationsacknowledge that while threats to their physical and financial safety occur less frequently than other forms of harassment, the impact of such threats makes them an utmost concern, especially for women within these groups.
3.2.2. Understanding journalistic norms centering online engagement
Despite online harassment and its resulting harms, journalists cannot withdraw altogether from social media. Aside from the importance of press freedom, our interviews resoundingly confirmed that social media platforms are an integral part of journalists’ professional workflows. In fact, the platforms that journalists listed as primary vectors of harassment were also the platforms they used most for work; all of the journalists we interviewed stated that Twitter/X was the primary and most important social media platform for their profession.While prior work has found that public-facing platforms like Twitter/X are often more hostile ecosystems for journalists than more private platforms like Facebook(Lewis etal., 2020), we find that the public nature of Twitter/X makes it most central to journalists.Our participants detailed various professional uses for social media, including keeping up to date with breaking news, reaching out to sources, and promoting their work. J6 described this “need to have a social media presence” as a double-edged sword that opened a communication channel between journalists, sources, and readers, but ultimately left journalists vulnerable.
Many of our participants felt they needed to actively engage in online discourse. According to their responses in the mention sorting exercise (Section3.1.1), two valuable aspects of engaging with their online audience were (1) understanding what opinions people have and why, and (2) refuting unverified or incorrect information in the comments. For the latter, two of our participants explained that they felt responsible as journalists for curating comments on their social media posts so that readers consume accurate information.Our participants also acknowledged that journalistic norms require that they appear receptive to criticism. Three journalists noted that they hesitate to block attackers or hide problematic replies, as both of these are publicly visible actions on the Twitter/X platform. As such, journalists wishing to use these features face potential backlash: accusations of censoring free speech or demonstrating poor journalistic professionalism. Several journalists elaborated on how silencing themselves or the dissenting opinions of others on social media was fundamentally at odds with their perceived role as public informants:
“If people want to criticize me, it still is in the public interest for people to be able to see my work. People should be able to follow me and look at my work and look at my tweets, because I’m a journalist, and I serve the public.” – J2
“[As a journalist] you do want to be receptive to feedback, because you are doing this to inform an audience. You are not in an echo chamber. I am not a blogger.” – J4
We contextualize these observations through the concept of reciprocal journalism. Lewis etal.argue that reciprocity, or the principal of mutual exchange within a community, applied to journalism serves as a mechanism for journalists to re-imagine a journalist-audience relationship that positions journalists as community-builders. While Lewis et al. acknowledge the possible harms of social media exchanges, such as revenge or trolling(Lewis etal., 2014), other research has found that journalists experience cognitive dissonance from the harassment they receive when engaging in reciprocal journalism; despite this cognitive dissonance, researchers found that organizational and personal influences that encourage reciprocity were too strong for any of the journalists they studied to stop reciprocating altogether(Deavours etal., 2023). Consistent with these results, we find that journalists are subject to professional expectations that make the passive consumption of, active engagement with, and curation of social media content central to their work, ultimately requiring journalists’ participation in online spaces.
3.2.3. Mitigation strategies
Limited use of third-party & platform moderation tools
Although all of our participants described needing online protections, they largely did not rely on third-party tools to defend against online harassment. Only two journalists mentioned such tools, naming Block Party (an anti-harassment tool for Twitter/X that facilitates bulk blocking accounts)(Party, 2023b) and Circleboom (a Twitter/X management tool that provides analytics and tweet deletion services).We found that journalists’ limited usage of third-party safety tools is due to the “lack of awareness of any new resources and third party tools” (J7) and the functional limitations of existing tools, which we detail later in this section.Although J7 did not use third-party tools themself, they felt that third-party tools had the potential to democratize protections against online harassment for local newsrooms and independent journalists, who do not have access to the same resources as larger newsrooms.One participant described bespoke security and safety filters for blocking phone calls offered by their large newsroom, but none of our participants mentioned their organizations when describing how they learn about new resources or seek support in the face of harassment.
As far as first-party safety and moderation mechanisms offered by social media platforms, five journalists mentioned using the blocking or muting features on Twitter/X. However, three of these five noted that blocking is not a mechanism that they often leverage. One journalist explained that they would only consider escalating to blocking or muting if the attacker was persistent.In the mention sorting exercise, we found that journalists’ preferences for removing content greatly varied. However, the type of inflammatory comment that seven of the eight journalists opted to remove was sexually explicit language directed toward a journalist; two of these seven journalists explained that this was because the sexual comment bordered on physical threat. We also observed that if the comment inquired for the journalist’s PII, threatened their physical safety, demonstrated a pattern of “bad behavior” from an account, contained sexually explicit language, or included unverified or incorrect information, journalists were more likely to take actions such as blocking, reporting, and muting.
Pitfalls of Account Blocking
We next investigate our participants’ general apprehension of blocking, one of the main protective mechanisms provided by social media platforms. We find that some journalists believe that blocking offending accounts is too costly in terms of both time and mental well-being. J2 elaborates that because of these costs and how they typically receive harassment, blocking was not a viable option for them:
“[Blocking is] just not worth the effort. A lot of the times, I feel like the harassment is more than likely coming from a different person, or at least different accounts, each time. So, I don’t think blocking someone will do much.” – J2
Block Party attempts to reduce the cost of blocking by allowing its users to block others in bulk based on account attributes, such as number of followers, account age, profile description, or engagement history with tweets of interest (e.g., blocking all users that retweeted or liked a tweet). However, even bulk blocking raised concerns for our participants: two participants worried that blocking accounts with few followers could obstruct constructive dialogue or even access to valuable sources. J3 described the frustrations that they had when using Block Party:
“I have some settings in [Block Party] about the types of account I don’t want to hear from, but I think they do end up catching ‘friendlies’ that are simply smaller accounts. I find that frustrating because I was a small account, and I want to hear from people that I am not fighting with.” – J3
As such, J3 felt that existing third-party tools did not cater to journalists’ specific needs, stating that if they were to use a tool, its design must show that “it understands the actual experiences that people who get harassed go through and address their concerns and fears,” rather than forcing them to “shoehorn” their situation into its protective mechanisms.
Furthermore, while blocking can shut down persistent behavior from a specified account, it still fails to protect against the use of “sockpuppet” accounts — accounts that are a part of a set of many accounts controlled by a single “puppetmaster”(Kumar etal., 2017), or a distributed attack model spanning numerous legitimate accounts. Beyond the technical efficacy of blocking, our participants were also wary of its negative externalities. For instance, two journalists described a threat model in which they received harassment from a colleague; however, they felt that the complexities of their social and professional networks, paired with the public visibility of blocking and muting on Twitter/X, made using these features too detrimental to their professional development. Additionally, J8 described how blocking could give attackers validation that they harmed their targets:
“It feels like you are giving something up when you choose to block somebody, and they can see that they got under your skin. I would worry that it would cause other people to [harass you] too.” – J8
As such, J8 felt that blocking their attackers could actually encourage them and trigger further harassment, creating a positive feedback loop. While not described by our participants, prior work(Salehi etal., 2023) also discusses the weaponization of protective platform affordances, such as the mass reporting of target accounts or content as a silencing tactic by attackers, which has restricted vulnerable populations usage of such tools.
Raising thresholds for social media exchanges
Due to the shortcomings of existing protection mechanisms in the face of reciprocal journalism norms, half of our participants stated that they typically try to ignore online harassment rather than actively moderate it.Across the board, we found that journalists have raised their thresholds for actively engaging with their audience using various criteria, such as the account attributes of a commenter or the perceived intention or usefulness of a comment, to inform their decisions. For example, one factor that would encourage journalists to actively exchange with a reader was if their account appeared legitimate (e.g., they had a profile picture and username that seemed to correspond with their real identity, they had a reasonable number of followers, they had a history of platform usage). Likewise, three journalists described that they were more willing to interact with their audience if the exchange was a critique of their work. On the other hand, we found that journalists also justified ignoring or dismissing comments if they felt they were “not to be taken seriously” or the language undermined the commenter’s own credibility (e.g., the commenter is perceived as irrational or unintellectual when they use baseless ad hominems). These observations echo prior work that has documented journalists’ increasingly negative perception of online audience interactions(Lewis etal., 2020). In the closed card sorting exercise, journalists generally did not opt to remove such irrelevant or nonsensical comments, instead preferring to passively view (and dismiss) them. Particularly, two journalists specified that they would still want to see even offensive or rude comments that engaged with the topics in their story, even if they were not insightful or polite. Six of the eight journalists opted to defer reading, or separate as if in a spam folder, at least one of the inflammatory comments. These journalists preferred deferring the view of inflammatory comments to removing them when they felt that viewing such a comment would allow them to better understand their audience or monitor patterns of harassment from repeat attackers for escalation potential.
3.2.4. Summarizing needs
Above, we discussed the various threat models of online harassment that journalists face, the professional norms that dictate how journalists operate in online spaces, and the mitigation strategies that journalists employ (Section3.2.3). Based upon these findings, we argue that the tension between the expectations of reciprocal journalism and journalists’ desire to protect themselves from harassment raise online safety needs that are unaddressed by current mitigation strategies.Grounded in our interviews, we summarize the following outstanding needs of journalists when engaging with online audiences:
- •
Journalists need to use social media to contact sources, receive tips, and gauge reader feedback.
- •
Journalists need to actively participate on social media platforms to engage in reciprocal journalism and adhere to professional norms.
- •
Journalists need protection from several different types of online attacks, including one-time commenters, persistent abusers, and large-scale networked harassment.
- •
Journalists need protections that do not validate the efforts of attackers or trigger further harassment.
These needs reflect that while journalists must actively use social media, they are not equipped with the tools or resources to do so safely.As we described in Section3.2.3, news organizations do not adequately provide resources or support to journalists facing online harassment. This gap has left journalists to turn to third-party tools or platform-based levers to protect themselves. However, these protections ultimately fail to resolve the tension manufactured by journalists’ need to cultivate bidirectional journalist-audience relationships. We argue that these deficits make existing protection mechanisms difficult, undesirable, or impossible for journalists to use effectively.
4. PressProtect: Navigating Social Media in the Face of Harassment
Derived from the unmet needs that we identified in Section3.2.4, we synthesize how we can make social media safer and more usable for journalists in the face of online harassment. In this section, we begin by defining design goals for such an interface. We then use these goals to guide our design of PressProtect, which provides journalists greater control over engaging with readers on Twitter/X by adding UI controls for the display of different types of reader replies.
4.1. Design Goals
We map the needs in Section 3.2.4 to the following concrete design goals (G1–G6):
- (G1)
Journalists should be able to seamlessly access helpful and harmless reader responses while protecting themselves from harmful reader responses.
- (G2)
Journalists should still be able to interact with harmful responses that are useful for their work, without simultaneously encountering harmful and unhelpful responses.
- (G3)
Journalists should be able to access and understand the full scope of reader responses to their articles, including those that are harmful, if they so choose.
- (G4)
Journalists should be able to use these protections even in the face of large-scale networked harassment like dogpiling333On social media, dogpiling is when someone is targeted by large groups (Marsden, 2017). or adversaries that leverage bots or sockpuppets.
- (G5)
Journalists should be able to use protections without the fear of silencing good-faith users.
- (G6)
Journalists should be able to use protections without the fear of affirming attackers’ attempts to harass them or appearing to censor free speech or ignore feedback.
4.2. From Goals to Interface
Guided by these design goals, we introduce PressProtect, an interface for Twitter/X intended to make the platform more usable for journalists. We chose to develop our interface for Twitter/X, because based on our need-finding interviews, it is the most central platform to journalists’ work. Furthermore, we focused on replies to journalists’ stories that are posted on Twitter/X because journalists identified replies as a primary vector for harassment. Motivated by journalists’ need to use social media to gauge reader feedback and engage in reciprocal journalism (Section3.2), PressProtect allows journalists to decide when to consume or interact with different types of reader replies.Based on G1–5, it categorizes comments using content-based attributes (i.e., how harmful and helpful a comment might be) to proxy journalists’ process for reasoning about reader engagement.Using these categorizations, PressProtect adds UI controls for content that could be harmful, guided by G1 and G2.
This content-based abstraction for reasoning about reader replies also naturally handles threat models that involve many accounts, such as dogpiles and sockpuppets, because it does not rely on account attributes (G4). Similarly, this content-based approach satisfies G5 because it does not deny smaller or new accounts access to engaging. However, grounded in G3, the interface does not dispose of any content, even if it is harmful: journalists can still choose to view all comments. Finally, the protections offered by the interface are fully client-sided, as required by G6. Commenters, even abusive ones, are not aware of the journalist’s use of PressProtect or how their comments are categorized, and they are not denied access to engaging with the journalist or their work. Below, we discuss the implementation of PressProtect’s two underlying components: how content is categorized and how these categorizations are presented to the journalist.
4.3. Content Categorization
Accomplishing G1 and G2 hinges on emulating journalists’ processes for reasoning about how harmful and helpful reader responses are.To do this, we create working definitions of toxic and relevant content, grounded in the results of our need-finding interviews (Section3.2.3). We define toxic in accordance with Google Jigsaw’s Perspective, an API that classifies comments by toxicity using machine learning, which states the following: “We define toxicity as a rude, disrespectful, or unreasonable comment that is likely to make someone leave a discussion.”444https://perspectiveapi.com/ This definition of toxicity maps directly to content attributes that shape journalists’ willingness to engage with reader comments, such as if the comment appears irrational. Because journalists derive utility from comments that engage with content or topics in their stories and are more likely to engage with such comments, PressProtect uses how relevant a comment is to their stories as a proxy for how useful it is. Therefore, we define a relevant comment as one one that addresses the content of the news article linked in the journalist’s tweet, as determined through a GPT-3.5 prompting process that provides both the article and comment text.
Using our toxicity and relevance classifications, we bucket comments into four categories (C1–C4), as shown in Figure1:
- •
C1: Relevant and non-toxic — replies that may be useful to the journalist with minimal harm. These might include productive discussions on the quality of the reporting or reference future directions, tips, and sources.
- •
C2: Irrelevant and non-toxic — replies that may be less useful to the journalist, but are likely harmless. For example, they could be lighthearted or tangential remarks.
- •
C3: Relevant and toxic — replies that are possibly harmful, but could contain information that is useful to the journalist. For example, they might present commentary on something the journalist missed but be couched in offensive language.
- •
C4: Irrelevant and toxic — replies that are likely harmful and not useful to the journalist. For example, they may be hate comments with little substance.
We provide examples of comments that a journalist might encounter in each of these categories in Table3. We paraphrase these example comments from real comments in participants’ data to preserve anonymity.
Not Toxic | Toxic | |
---|---|---|
Relevant | “I wouldn’t have ranked X over Y, but I can see why you did.” | “They did terrible things, yet you wrote a glowing story. What a disgrace.” |
“Thanks for your article, but you used the wrong term for this illness.” | “This is such a stupid ruling.” | |
Irrelev. | “Geez” | “No one gives a ****, this is 2023.” |
“@USERNAME” | “Astrology is just as fake as climate change.” |
4.3.1. Toxicity
To tag tweet replies that may contain harassment, we use Google Jigsaw’s Perspective API(per, 2023), which is a state-of-the-art toxicity detection system that has been used extensively in prior work(Kumar etal., 2023; Hua etal., 2020; Saveski etal., 2021; Xia etal., 2020).For the purposes of our tool, we use Perspective’s TOXICITY classifier with a threshold of to identify if a comment is toxic. We use this threshold, as prior work observed this thresholdbalanced precision and recall well for Twitter/X content(Saveski etal., 2021). We confirm this with our own thresholding experiment using a dataset from Kumar et.al(Kumar etal., 2021), which we detail in AppendixB.
4.3.2. Relevance
To determine whether a tweet reply is relevant to a given article, we use few-shot prompting of OpenAI’s GPT-3.5, an off-the-shelf large language model (LLM). To evaluate whether this approach fit our task well, we first built a ground truth corpus of relevant tweet replies for a set of published articles. To do this, we collected 1,787tweets by 1,439journalists from eight different newsrooms: The Wall Street Journal, AP News, Financial Times, The Guardian, Bloomberg News, The New York Times, The Washington Post, and USA Today. We only considered tweets that contained a link to a published news story from each outlet. We scraped the content of each news article and processed each one using the Newspaper3k library,555https://newspaper.readthedocs.io/en/latest/ which provided us both the title of the article and a sanitized version of the article text. We then collected each tweet reply to the original tweet containing the link. In total, we amassed 3,973 tweetreplies. Two independent researchers then coded a sample of 300tweet replies after reading the article text to establish a ground truth test set for tweet reply relevance to news articles. There was high agreement between the raters, achieving a Cohen’s Kappa score of 0.8. The raters then resolved differences and ultimately aligned on a final “ground truth” corpus.
We attempted three strategies to identify tweet relevance: keyword matching based on article title, topic-matching using Latent Dirichlet Allocation (LDA), and few-shot prompting of GPT-3.5. For the LLM evaluation, we use the following prompt:
The following is the text of a news article:<article_text>.Consider the following comment:<comment_text>Return a JSON object with a field, "relevance," that is ascore from 1 to 5 depending on how relevant the commentis to the article.
We then provide three examples of relevant and irrelevant replies to better scope the LLM to our task. We consider a score of three or higher to be “relevant” and a score below three to be “irrelevant.” We note that while there may be more effective prompting strategies to improve the performance of LLMreasoning for this task (e.g., Chain-of-Thought prompting or self-reflection), the purpose of our work is to understand how applicable PressProtect’s abstraction of toxicity and relevance is to journalists’ usage of social media.
Relevance Technique | Precision | Recall | Accuracy | F1 |
---|---|---|---|---|
Title Keywords | 0.91 | 0.57 | 0.59 | 0.70 |
LDA | 0.95 | 0.12 | 0.25 | 0.21 |
GPT-3.5 | 0.86 | 0.75 | 0.74 | 0.80 |
Table4 shows how well each strategy performs on our test set in terms of precision, accuracy, and F1. Although LDA provides the highest precision, we find that it suffers from significantly low recall and heavily skews towards labeling replies as “irrelevant.” In contrast, both title keyword match and GPT-3.5 achieve high F1scores, with GPT-3.5 achieving an acceptable F1 of 0.8, suggesting a good balance between precision and recall on this task. As such, for the scope of our tool, we utilize GPT-3.5 for identifying tweets that are relevant to article texts in our system.
4.4. Content Presentation
To provide journalists safety and support in their engagement with potentially harmful content, we display comments to journalists differently based on their content categorizations. Our presentation of the comments is guided by G1-3, which states that journalists should be able to seamlessly access helpful, harmless reader responses they receive while being protected from harmful ones; to interact with harmful responses that are useful for their work without being exposed to unhelpful, harmful responses; and to access the full scope of responses if they so choose.To accomplish these, PressProtect introduces additional UI controls that provide the journalist greater agency in choosing when to encounter and interact with comments that are likely harmful. These controls are different depending on whether or not the likely-harmful comments might also be helpful to the journalist’s work, so the journalist can more safely interact with potentially harmful yet helpful comments if they are compelled to do so. Below, we detail how PressProtect presents the content by walking through its different screens (Figure2) with a user scenario.
User Scenario — Consider a journalist who recently tweeted about their coverage of a controversial issue, attracting a large volume of reader replies to their tweet that contains a mix of helpful and/or hateful content. The journalist can safely engage with those replies through PressProtect. First, assume the journalist wants to see readers’ constructive and likely-harmless feedback. To do so, they can click the “Show replies” button on the tweet from the homepage, surfacing the replies that PressProtect has classified as non-toxic (Panel 2 in Figure2). To foreground the replies that are more likely to be helpful to the journalist’s work, PressProtect also groups those classified as relevant at the top. The design of this screen is motivated by G1, as it provides a level of protection against the likely-harmful reader responses and helps the journalist to easily benefit from the utility provided by “good” comments (which might include sources, constructive critiques, and tips). We considered displaying the comments without grouping by relevance but decided against it, as this would not as effectively accomplish G1 given that relevant comments likely provide greater utility to journalists than irrelevant ones.
Next, assume the journalist wants to engage with critical feedback on their article. For example, they may want to understand what readers perceive to be gaps or bias in their coverage and anticipate that this feedback may be delivered in a heated way. To view these comments, they can click the “Show hidden replies” button, bringing them to the replies that PressProtect has classified as toxic but relevant (Panel 3 in Figure2). The design of this screen is motivated by G2, as it enables the journalist to interact with potentially harmful comments that could still be useful to them without needlessly being exposed to potentially harmful and unhelpful comments. We considered displaying all of the toxic comments on this screen at once, with the irrelevant ones displayed under the relevant ones, but ultimately chose not to; we determined that this would not as effectively accomplish G2 given that the journalist might end up being exposed to irrelevant and toxic comments without wanting to, reducing their control over exposure to potential harassment.
Finally, assume the journalist wants to view the full scope of reactions to their article — including the comments that are unhelpful and toxic. For example, they may simply be curious, or they may want to be informed of potentially threatening replies. To do so, the journalist can click the “Also show replies that are both irrelevant and toxic” toggle (Panel 4 in Figure 2). This aligns with G3, as it preserves the journalist’s ability to “view everything.”
While iterating on the design of PressProtect, we also considered adding visual indicators to the screen shown in Panel 1 of Figure2 to summarize the toxicity and relevance of the reader engagement for each tweet. We decided against this because it was unclear how we might summarize reader engagement accurately and meaningfully, especially in the context of how journalists reason about it in nuanced, granular ways. The exploration of designing effective feedback summarization in the presence of harmful content for various populations (e.g., journalists, politicians, professors, content creators) remains an important direction for future work.
5. User Testing for PressProtect
In this section, we present the results of a user study with eight journalist participants (P1-P8) who explored what audience engagement on social media would look like when mediated by PressProtect. Through user testing, we evaluate PressProtect’s effectiveness at addressing the needs we synthesized from our need-finding interviews and discuss further insights from participant feedback.
5.1. Study Design
To evaluate PressProtect, we recruited journalists who were: (1) active on Twitter/X and (2) authored tweets promoting their own stories that other accounts replied to. We decided on these criteria because we developed our interface to integrate with Twitter/X and because an evaluation of our interface would only be pertinent if there were replies that might be considered toxic or relevant. Two of our participants were also participants in our need-finding interviews. We otherwise recruited six additional participants by cold e-mailing journalists that satisfied these criteria and asked participants for referrals to colleagues that were likely fits. Seven of our participants tested our system populated with their own tweet data, and one participant instead tested with data from a close colleague in the same newsroom and on the same beat.
The user testing was semi-structured with questions that guided participants through the different interface screens, where we encouraged them to talk through their thoughts and reactions while interacting with each page. The full set of the structured questions is available in AppendixC. Because of the open-ended nature of these user tests, we asked additional questions when relevant. The user tests lasted between 20 to 45 minutes, varying based on the participants’ past exposure to online harassment, their insights with protective tools and tactics, and their industry experience. We discuss the limitations of conducting a time-boxed user study in Section6.3. Participants were compensated for their time with a $30 Amazon giftcard. Similar to the methods for the need-finding interviews (Section3), two researchers coded the transcripts of the interviews using thematic analysis and an initial set of categorical labels informed by the user test questions that were expanded as needed. We again use IRR as a process to reduce confirmation bias and achieved a Kupper-Hafner agreement metric of 0.70, indicating high agreement. The two researchers discussed and resolved these conflicts to determine the final codes for the data. The full codebook can be found in AppendixLABEL:a:user_test_codebook.
We do not report individual demographic information about our participants or disclose which newsrooms they are employed by because of the risk of deanonymization. However, the following are aggregated demographics information: three identified as men, and five identified as women; all of our participants are or were previously employed by major newsrooms, with the number of yearly subscribers to each newsroom ranging from hundreds of thousands to millions; the beats covered by our participants also varied — four covered technology, two covered the environment, one covered politics, and one covered sports.
5.1.1. Ethical Considerations
Because of the sensitive nature of this research, we informed all of our participants about the goal of this study before we began the interviews. Although we presented all but one of our participants with content that they had actually received on their Twitter/X accounts, we made sure to inform all participants of the potentially triggering nature of the content they were about to consume in our study. Additionally, we reminded participants that they could decline to answer any questions and were able to stop the interview at any time, and we obtained their consent to record the interviews. Because of both the online and offline threats to journalists’ safety, we treat the information our participants shared with us as highly sensitive. As such, we again remove PII from any interview excerpts included in the paper and only present aggregate descriptive statistics to protect participant anonymity. This follow-up work received approval from our IRB.
5.2. Findings
In this section, we present the findings from a series of user testing interviews of PressProtect with journalists who experienced significant harassment on Twitter/X. Through this evaluation, we find that participants:
- (1)
felt effectively protected by PressProtect against harassment and believed it could generalize to serve other visible and vulnerable groups,
- (2)
valued customization in automated tools,
- (3)
expected and made sense of errors in automated classification,
- (4)
did not feel valuable reader engagement was hindered by PressProtect,
- (5)
were keen of PressProtect’s application in other contexts, and
- (6)
surfaced a need for a tool to discern physical threat from other harassment.
5.2.1. Participants felt that PressProtect effectively protected them against harassment and could generalize to serve other online users in the public eye.
Throughout the interviews, it was apparent that preferences for dealing with online harassment varied — even within a curated set of journalists who had previously dealt with online harassment. Still, all participants agreed that the option to use a system like PressProtect would likely benefit many journalists. Four participants, including P8, added that this tool would be especially relevant to the experiences of marginalized-identity journalists or those that report on sensitive beats:
“I think journalists of color [would benefit from this tool], as well as female reporters—anyone in a marginalized community would get more harassment and more attacks than an average white dude…also, reporters covering issues like immigration, gender, racial justice, investigative reporters. Those are the groups of people who may benefit more from this tool than other reporters.” – P8
When examining what aspects of PressProtectparticipants appreciated, we found that participants saw value in the nuance of its underlying logic, which considers the relevance and harmfulness of comments without binarizing them as “good” or “bad”:
“[PressProtect] reminds me of [Block Party], the way it’s trying to filter responses and give you a little bit more control over what you’re looking at, so I think this could be useful.” –P1
“I really liked the fact that [PressProtect] separates [comments] into three groups instead of two. When you look at Twitter and some of their implicit tools, even ones that worked a little better a couple months ago before the takeover, to look at [what Twitter labeled as] harmful replies, it’s just based on language. They are all categorized as, ‘these are okay’, and ‘these are not,’ and I think [the different logic] is what is good about [PressProtect].” –P4
Similarly, participants enjoyed the ability to filter comments that were unlikely to lead to meaningful discussions using relevance. For instance, P5 explained how the relevance axis of PressProtect’s abstraction was actually the one they might personally find the most utility in:
“If I look at all the [tweets] that were hidden, they’re not tweets that I would have, under almost any circ*mstances, responded to—not necessarily just because they were harmful—but also because they weren’t helpful or real conversation topics…I think there are people who would emphasize different parts of this tool. For me, I think it’s more about filtering out irrelevant stuff.” – P5
More broadly, four of the participants felt that PressProtect’s underlying logic could also serve others that are required to maintain an online persona in the public eye, such as celebrities, politicians, or social media influencers.
5.2.2. Participants value the ability to customize automated tools for a wide range of personal preferences.
To understand the extent to which PressProtect’s axes of toxicity and relevance could represent individual journalist’s preferences, we investigated how each journalist defined these classifications. When asked to define what “toxicity” meant in the context of professional online engagement, our participants unsurprisingly gave various definitions. Some described toxic interactions as ones that involve insult, abuse, or identity-based harm, and others perceived toxic interactions as ones that were inherently irrelevant, describing them as as ones that did not “engage substantively with the story.” Others discussed how the scale of engagement could also make otherwise manageable interactions feel like harassment. Seven of our participants acknowledged that there are many factors that shape their personal thresholds for engaging online, such as past experiences of harassment, newsroom-dependent standards, and the various threat models they are vulnerable to (e.g., nation-state censorship, personal stalker).For instance, P7 described the variable degree of firsthand triaging journalists might have to undertake depending on how well-resourced their newsrooms were:
“If you’re in a newsroom that provides a lot of support, or doesn’t provide a lot of support, that definitely shapes the way that you would experience [harassment]. [My newsroom] provided a lot of support…Other newsrooms that I worked at wouldn’t provide anything, so there would be a much bigger emotional burden of needing to deal with it myself, needing to figure it out and triage what I’m receiving.” – P7
Consistent with intersectional feminist theory, four of our participants engaged with how various identity categories (e.g., gender, sexuality, race) shape harassment experiences, and as a result, their preferences for using social media.One participant explained their personal level of comfort with seeing most of the comments they receive comes from the degree of harm from identity attacks they anticipated as a cis male. They noted that other marginalized-identity journalists might have a different threshold for what content they are willing to expose themselves to by default:
“I want to be very clear: I am a cis male writer, so certainly I’m getting race-based attacks, but it’s different as a woman in the industry; it’s different for somebody who’s queer in the industry.”
Other facets described by our participants that influenced their thresholds included what past harassment experiences they had (e.g., if they had a story go viral, if they had been a target of nation-state censorship), or if their beat was particularly sensitive. The expectation that some areas of reporting attract more harassment than others was well-known to participants, and P7 elaborated on how this made such beats self-selecting for those with “thick skin”:
“For journalists who are on politically charged beats, like China, the alt-right, or Trump—these are beats that are known to attract a lot more harassment. There are some job posts that actually indicate that you need to have a thick skin because of the level of harassment the journalist receives. So, in a way…the more contentious beats actually self-select for people that can just deal with it. I wouldn’t say that everyone can.” – P7
Although participants felt that the axes of toxicity and relevance aptly characterized their engagement preferences for a given comment, the many facets of each journalist’s experience and identity shaped their individual thresholds for toxicity and relevance. Five of our participants felt that PressProtectwould better serve a wider range of journalists if they could customize these feature thresholds. For instance, two journalists suggested optimizing PressProtectfor a low false positive rate when on the job; three others proposed adding a finetuning mechanism for PressProtect’s classifications of toxicity and relevance to better align with their individual definitions. P1 elaborates on how they imagine finetuning an automated tool might integrate into their workflow:
“I think there’s definitely some sort of finetuning that could be done with the filtering. I think it could be potentially useful to allow the person using the tool to change the classification, if they feel like it’s incorrect. In the example that we talked about earlier, I was like, this person is being very polite in their tweet, but there’s a potential for harassment. It would be nice to be able to toggle that to your toxic replies and save that in a separate folder, so you can return to those accounts and see what they’re doing. Or, if you see a tweet that’s misclassified as toxic, and you’re like, Oh, this is actually a relevant reply, I want to move it back to my main replies, you could click into these little icons and change how they’re [classified]. I think that could be helpful.”
5.2.3. Participants expected and rationalized errors in automated classifications.
All participants acknowledged that any automated classification tool is bound to make decisions that deviate from their own. Seven participants described how PressProtect performed as well as or better than their expectations for an automated tool. Some categories of content that P5 perceived as out of scope or difficult for the system included jokes or sarcasm, which they also felt real people might struggle to discern:
“I don’t have a huge issue with the way that [replies are] categorized — even the joke tweets, but that would probably be a little bit of a blind spot [for the tool]. And again, frankly, that’s a blind spot for real people on the internet trying to figure out what’s going on too, so I certainly don’t expect any AI model to be able to 100% [categorize comments correctly].” – P5
Another participant felt that it was unreasonable to expect any automated tool to incorporate relevant context or a body of external knowledge about the story in its classifications:
“This one maybe is misclassified…I think this would be a really tricky thing to do automatically, because if someone is referencing a known public figure that’s involved in that story somehow but not discussed in the article, I don’t know how you would sort that or classify that without reading all the stories about this and knowing the context. So that makes sense to me, why it’s classified the way it is.” – P2
Still, the consensus among five of our participants was that these disagreements between the classifications and their own perceptions were an acceptable cost when compared to PressProtect’s benefits. For example, P4 imagined a workflow using PressProtect as a first-pass filter to decrease their manual triaging workload, a process they were already accustomed to with e-mail spam:
“I feel like [the risk of false positives] is a fair trade off for me…I’d be fine with having to take a little more time to sift through the hidden replies. At work, we have a tool that’s really similar to this, but for e-mail spam [where] most e-mails will come through, and then a couple e-mails will go to quarantine.” – P4
5.2.4. Participants did not feel PressProtect would hinder valuable interactions with readers.
Although in our need-finding interviews, we found that it was crucial that journalists be perceived as receptive to reader feedback, consistent with prior work (Lewis etal., 2020), all of our participants expressed a cynicism for the quality or value of engagement they were likely to receive on social media.Three participants specifically expressed distrust in the quality of interactions on the platform after Elon Musk’s acquisition, citing upticks in spam and abuse and the shutdown(Party, 2023b; Together, 2023) of third-party tools that had previously helped them navigate Twitter/X. P1, especially after Musk’s Twitter acquisition, adopted a personal policy to not engage on Twitter/X, and they felt that the professional value of Twitter/X replies had plummeted:
“I think there’s such limited value in a Twitter reply—even non-toxic and technically relevant Twitter replies for the most part—that anything that you would lose [by incorrectly categorizing content], you’re going to be losing something that would have been of only very marginal benefit anyway.” – P1
Additionally, P7 felt that even missing reader responses that were not harmful was not a critical issue for a protective tool:
“I’m not really worried about missing good comments. I receive so many inbound things every day, whether it’s a Twitter comment or LinkedIn message or an email, and my policy is I just get to what I get to. I’m not going to kill myself trying to reply to every single one, because it would just be way too much time, and I wouldn’t get any work done. So things like good comments that I don’t see and don’t get to engage with, I probably wouldn’t have engaged with it anyway.” – P7
As such, we observe that participants’ views of engagement integrity and quality were closely tied with their trust in the platform and the platform’s ability to effectively moderate content. While six participants acknowledged the drawbacks to mistakenly categorizing comments as toxic (i.e., hampering audience engagement in participatory journalism, missing a connection to a potential source), these participants were not deeply concerned that it would negatively impact their already unsatisfactory journalist-audience experience on Twitter/X.
5.2.5. Participants were keen on the application of PressProtect’s features in other contexts.
Several of the participants noted that the underlying logic of PressProtect could be leveraged or extended in other ways. Two participants specifically mentioned the potential of using this automated categorization mechanic as a triaging tool to help decrease the manual burden of safety teams in newsrooms. For instance, P7 explained how incorporating PressProtect into a triaging pipeline could allow journalists to assess risk without direct exposure to the content:
“I think if a newsroom had a protocol where they took responsibility and ownership over monitoring things for you, then there could be a really great benefit where [PressProtect] directs all of [the harmful comments] to the newsroom security team. So someone is seeing it, but not you. And then they can tell you what their risk assessment is without you having like been exposed to like the very direct toxicity.” – P7
The proposed use of PressProtect as a triaging tool mirrors the comment moderation pipeline explored by Moderator, which uses classifiers to group similar comments for easier review by human moderators(Jigsaw, 2016). Some other the potential ways to customize the tool discussed by our participants included the ability to do the following: share or offload the burden of triaging replies with another trusted entity (i.e., a colleague, similar to friendsourcing), toggle between sorting comments by relevance or in chronological order, or toggle the view of comment categories depending on if they are using Twitter/X professionally in that moment or are anticipating a flood of comments.
Participants also noted that PressProtect could serve journalists that do not have the resources and support of a dedicated newsroom security team, echoing the reflections of J3 on the need to democratize online harassment protection via third-party tools (Section3.2). P1 and P4 described that PressProtect’s logic could also benefit journalists when applied to e-mail and direct messages. While P4 noted that their newsroom provided filtering tools for e-mail, the filter’s purpose was to protect against spam and scam, rather than harassment. In addition, P8 suggested that PressProtect could be extended by integrating with platform moderation mechanisms, such as reporting and blocking, to streamline protections against state-sponsored harassment, which often leverages bot accounts.
5.2.6. Participants wanted a distinction between imminent, physical threat and other forms of online harassment.
As discussed above, participants expected and were generally not concerned with the ramifications of falsely labeling non-toxic and even relevant tweets, or false positives. However, a theme that emerged in five interviews was the importance of flagging urgent threats to their physical safety and not losing this signal by filtering such comments from default view or incorrectly classifying them as non-toxic. One participant was not concerned with “losing edge cases” for ad hominem verbal attacks, but they were concerned with (1) losing contact with a colleague or potential source or (2) not seeing a threat that could potentially escalate. Still, P1 felt that the tradeoff between the frequency of these events and the tool’s utility might be worthwhile:
“I think both of those scenarios [1 and 2] are pretty unlikely, so it’s a level of risk that would seem worth the benefit of having a lot of the most obnoxious stuff filtered out.” – P1
Others were less compromising on flagging such threats and the possibility of filtering them from view. Although P7 acknowledged that PressProtect could likely serve many journalists, P7’s frequent participation in public speaking events coupled with the smaller scale of engagement they receive on social media shaped their preferences for monitoring their online accounts; as a result, P7 preferred to filter nothing from their default comment view because identifying physical world threats took utmost priority over convenience and usability:
“I would just stop using [PressProtect] because of my bias towards wanting to see everything. I don’t necessarily care what order I see [comments]. I just want to, at some point, have seen them all…You still really need to monitor what people are saying to make sure that doesn’t translate into physical danger.” – P7
Two participants added that newsrooms struggle to combat online threat vectors to journalists’ safety. P7 explained how protecting journalists from online harassment has been a “huge weakness across the industry,” stating:
“There isn’t really a standardization or understanding of how to provide basic protections against digital harassment. There’s much more of an understanding of physical harassment, like getting you a bodyguard.”
However, to benefit from physical protections like a bodyguard, journalists must first be equipped to identify when their physical safety is threatened. This highlights a gap in the resources for assessing risk that must be addressed for journalists to take the necessary steps to obtain physical protections.Similarly, P8 detailed their experience with the failings of organizational and platform support when facing a state-sponsored online harassment campaigns:
“I just don’t want to engage with people on Twitter, period. But I think for me, it’s different because I’m more traumatized by state-sponsored harassment because that could lead to [feeling] physically unsafe. I feel violated and harassed when I see the state-sponsored [online] violence.”
“In my organization, they provide digital security security training [as a prevention tactic for online harassment], which I don’t think is very helpful, because I’m like, I know more than you do. And my situation is too unique for you to be able to give me a recommendation.”
“I had to go to our security people here at [my newsroom] and ask them [to report state-sponsored bots] because they know people at Twitter, and then they contacted whatever department at Twitter directly to ask them to monitor and remove the bots. I reported many times myself: nothing worked.”
P8’s experience underscored that a sense of online and offline safety can be deeply intertwined, and that in some cases, the harm from harassment is not simply contained to the message content. Rather, these campaigns publicly signal that these journalists are being monitored, causing them to self-censor and fear for their physical safety. Yet, both their newsroom and Twitter/X as a platform have not served their needs to identify and shut down these attacks. Additionally, the only avenues that were effective for P8 — (1) taking down state-sponsored accounts through a backchannel between their newsroom and employees at Twitter, and (2) self-censoring their online posts — are not necessarily desirable or accessible solutions for all journalists. This is especially true in the face of major changes in journalists’ attitudes toward and trust in the platform. We note that the ability to identify these dangers is not currently accounted for by PressProtect because it builds on the Perspective API’s toxicity clssifier, which does not disambiguate different kinds of harmful content, such as offensive language versus physical threat. We discuss this gap and its implications in more detail in Section 6.
6. Discussion
In this paper, we present the following contributions: (1) we formalized journalists’ unmet online safety needs, and (2) we developed an effective abstraction for journalists interfacing with social media in the face of significant harassment, using the axes of comment toxicity and comment relevance to their work. We discuss the implications of these findings on the design of anti-harassment tools and the roles of the stakeholders in the online journalism ecosystem.
6.1. Designing for Visible and Vulnerable Populations
To effectively design harassment protections for vulnerable communities, we must first make sense of the various dimensions of a community’s needs and how the nuances of these dimensions compare to those of other communitiesİn Sections3 and5, we investigated how journalists experience, process, and manage social media engagement in the face of significant harassment. Informed by our findings, we identify the following dimensions of journalists’ needs regarding online engagement and discuss the nuances of each:
Utility:The utility that journalists derive from online engagement (e.g., promoting their work, connecting to sources and their professional network, understanding reader responses) necessitate that journalists be active and visible on social media. Further, journalists want to view the work of their peers and readership opinions to inform their work. This centrality of visibility dictates that journalists may experience fewer benefits when restricting the visibility of their audience or their own accounts through the primary safety mechanisms offered by platforms (e.g., blocking or muting accounts).
External Constraints:Both the public and journalists’ own perceptions of journalistic professionalism reduce options for effective harassment mitigation, as journalists feel the need to appear open to criticism; we found that if journalists fail to maintain this public-facing image, they fear triggering accusations of being biased or sloppy, or pushing a conspiratorial political narrative. Upholding this receptive appearance does not require journalists actively reply to readers, but it does make existing moderation levers, like blocking accounts, unsuited for their professional norms. Furthermore, journalists’ usage of social media platforms is subject to news organizations’ explicit and implicit social media policies that encourage their active engagement in online spaces.While these policies can be ambiguous, violating them has cost journalists their jobs(Lee and Park, 2023).Because of these external constraints imposed by the public and their employers, journalists must cultivate “authentic”(Nelson, 2021) journalist-audience relationships in online spaces through open and active participation. While other populations, such as content creators or politicians, face similar reputational risks on social media, journalists must uniquely consider direct risks to their employment.
Safety:Journalists’ safety needs are shaped by the specific nuances of visibility and compulsory participation online. While platforms and some newsrooms provide resources for journalists facing imminent danger, the needs to (1) identify and document these threats and (2) monitor abusive patterns that may escalate to such threats fall upon journalists themselves to manage. Therefore, they must engage with online comments, even if they are harmful or triggering, to triage and escalate as necessary for their physical safety. Existing platform moderation mechanisms (e.g., blocking users, removing offensive content) binarize content as “benign” enough to be viewed or “harmful” enough to be filtered out completely; however, the nuances of journalists’ safety needs make these approaches impracticable and necessitate that platforms mediate journalists’ navigation of potentially harmful content in a safer way, rather than dispose of such content altogether.
Empowerment:Despite their intended purpose, platform abuse prevention levers such as blocking are not aligned with the journalists’ sense of agency and empowerment. Because platforms like Twitter/X indicate to users when they have been blocked, acknowledging and addressing harassment through blocking can instead empower abusers as positive feedback of successful harassment attempts. Other common strategies, such as disengaging or self-censoring, similarly disempower journalists and are at odds with the principle of press freedom. We observed this with one of our participants, who felt that a nation-state had succeeded in its attempt to silence them: “The intention [of the attack] is for me to stop doing what I do now and to be silenced precisely. And I think in some way, they’re successful because I’m no longer active on Twitter. I’m just like a bot promoting stories” (P8). As such, the nuances of journalists’ needs regarding empowerment in the face of online harassment have made existing strategies both ineffective and undesirable.
Sarah Jeong, a journalist whose social media controversy and subsequent harassment led her to quit her editorial position at The New York Times, encapsulates the interplay of these different dimensions in her experience with this flood of online attacks(Jeong, 2023):
“Twitter offers two tools to theoretically protect yourself, [blocking and muting]. Since the platform indicates when you’ve been blocked by a user, the Times asked me not to do it to anyone. Besides, the most motivated of my haters would make new accounts anyway. Large, influential accounts could drum up fresh troops to send forth into my replies…I wasn’t sure what was more unsettling: getting a death threat and seeing it, or not seeingit.”
Jeong’s experience was shaped by the nuances of the external constraints (from The Times, despite its past claim to “support the right of [their] journalists to mute or block people on social media who are threatening or abusive”(The NewYork Times, 2017)) and safety needs she had as a journalist, which ultimately made existing protective tools nonviable.Through this systematization of journalists’ needs, we argue that framing needs as multi-dimensional offers a lens for reasoning about the needs of other populations, particularly those that are also both visible and face outsized harassment online. Therefore, by considering the various dimensions of different communities’ needs, we can better design platform affordances and protections that actually address the needs of vulnerable groups.Furthermore, we argue for using this framing comparatively for groups. For instance, to what degree can PressProtect’s abstractions benefit groups such as social media influencers or politicians? How granular or broad are the groups that can share such abstractions? How similar are the needs between all visible and vulnerable groups? We argue that understanding different group needs in this manner can guide more thoughtful platform design for a diverse userbase.
Beyond a harm-centric research approach to supporting marginalized communities, this multi-dimensional framing may also be applied to the desires of such communities that will enable them to flourish, rather than simply survive. Recent work in the interactive design space has highlighted the importance of fulfilling the desires of marginalized communities, particularly Black, Indigenous, and People of Color (BIPOC), to find joy, affirmation, and liberation in online spaces(To etal., 2023).Understanding the nuances of these desires (e.g., belongingness as a nuance of a desire to be empowered, cultural heritage as a nuance of a desire to express oneself) can likewise better inform designs that allow marginalized communities to flourish in online spaces.
6.2. Roles of Stakeholders
6.2.1. Newsrooms
Prior work in the computer security community has assumed that journalists (1) have viable alternatives to seeking audience engagement and (2) are effectively supported by their organizations to stay safe online (Samermit etal., 2023). However, our findings (Section5) indicate that journalists are often subject to newsrooms’ social media policies that require their participation, yet do not feel that these newsrooms adequately support them against online harassment.Our participants discerned that protections offered by newsrooms focus on addressing physical rather than online harassment. To effectively leverage these protections, however, journalists themselves must identify and document when threats become physical, manufacturing a safety need for journalists — or in the case of well-resourced newsrooms, their safety teams — to be able to monitor and triage high volumes of engagement for dangerous behaviors. We believe that the use of PressProtect’s abstraction to make sense of comments’ harmfulness and helpfulness, coupled with the development of automated tools to identify physical threats, can help journalists and newsroom safety teams to more effectively make sense of large-scale harassment.
A more fundamental tension underlying journalists’ experience of online harassment comes from an industry shift toward social media as the primary avenue for reciprocal journalism(Lewis etal., 2014). The literature in journalism practice defines reciprocal journalism as a concept that positions journalists as community organizers to build trust and social capital with their audience. However, the theory of reciprocity requires mutual “gift-giving” which, in this context, manifests as valuable thoughts, perceptions, or behaviors(Lewis etal., 2014). We observed that journalists lost faith in Twitter/X’s capabilities to: (1) enact effective platform moderation and (2) foster a userbase with “gifts” to give. We argue that, as it stands today, Twitter/X’s ecosystem has thereby made reciprocal journalism nonviable in practice for many journalists. In light of existing platform and organization limitations, we recommend that newsrooms reconsider both the explicit and implicit expectations that are imposed on their staff to better center the needs of their most vulnerable journalists — who are often those of marginalized identities — paralleling design recommendations made by the CSCW community(Ashktorab and Vitak, 2016; Schoenebeck etal., 2023; Scheuerman etal., 2018; Blackwell etal., 2017; Han etal., 2023).
6.2.2. Social Media Platforms
Twitter/X has positioned itself as a platform where “the public has access to reliable information from trusted sources,” even publishing a guide to “how journalists can use Twitter safely” with suggestions to use its native tools for addressing account security and online harassment threats(Twitter, [n. d.]).However, both prior work(UNESCO, 2021) and our data indicate that journalists view social media platforms as the primary channels for and enablers of online violence; moreover, the tenuous protections offered by existing platform safety tools have made journalists continue to feel unsafe.We observed how journalists have lost trust in the platform, especially after Musk’s acquisition of Twitter, to counter abuse despite its proclaimed efforts, as well as in the platform’s users to engage in high-quality interactions. As Twitter/X has become an inhospitable ecosystem for reciprocal journalism, we argue that if platforms wish to re-position themselves as central, usable, and useful to journalists, they must address this issue of lost trust.Prior work in information systems management has identified benevolence trust (i.e., trust that a platform will act in goodwill rather than opportunistically) as the most significant factor in users’ continuation on a social media platform(Chen Wu etal., 2013).Drawing from this,we argue more broadly that designing for vulnerable users can help re-build trust with those that have experienced harm and thereby aligns with platforms’ incentives; it behooves platforms to prioritize building and maintaining trust with its diverse user groups to sustain their platform usage.
Beyond the design of platform affordances and safety measures, we highlight the unrealized potential for third-party developers in this ecosystem. Past CSCW work has discussed how community or third-party developers are uniquely positioned to rapidly build platform tools to address vulnerable communities’ needs because they are unhindered by the frictions that platforms face when developing features or policies(Han etal., 2023). While most of our participants did not use third-party tools to manage harassment, several of our participants mentioned using or knowing about Block Party. Block Party, however, is one of many third-party Twitter/X applications that shut down due to Musk’s imposition of prohibitive API prices and degraded API access(Binder, 2023).Between Twitter/X and Reddit, this trend of stymieing developer access to platforms is a threat to the work of both third-party developers and researchers(Calma, 2023).Although Twitter/X has developed safety features (e.g., blocking and muting accounts, muting notifications), we demonstrate how these fail to meet the needs of journalists, and likely those of other visible and vulnerable populations. We argue that instead embracing the distinct role of external developers in platform and community governance will: (1) rebuild trust between platforms and their users, which will (2) enable reciprocal social media exchanges, and (3) democratize access to effective community safety tools. We situate the combined strengths of centralized (platform) and decentralized (end user, third-party developers, newsrooms) actors as a case that multi-level governance systems(Jhaver etal., 2023) may better suit governance in online social media platforms that have a diverse user population with vulnerable groups.
6.3. Limitations
Journalists are not a monolithic group: even within both of our studies, each with eight participants, we observed a large range of personal experiences, identities, and preferences for engaging online. Thus, the approach of finitely distilling the needs of a group, such as journalists, is inherently limited and cannot address all of the nuanced needs of every member.
In addition, both our need-finding and user testing studies are limited by the scope and scale of our recruitment processes. In this work, we began the participatory design process with AAPI journalists that self-identified as having experienced online harassment and self-selected to participate in our study. Therefore, we do not claim that our participant samples are representative of all journalists. Still, consistent with Blackwell et al.’s conclusion that centering the needs of marginalized groups will produce a design process that better serves all users, we found that our need-finding process with AAPI journalists yielded a set of design goals that, when actualized in PressProtect’s implementation, effectively protected a broader population of journalists, as our user testing participants were not recruited from a specific identity group. During user testing, we note that one journalist surfaced the additional need to discern physical threats from other kinds of harassment. Although participants in our initial need-finding study discussed experiences where online harassment had escalated to physical threats and the impacts of these experiences, the need to identify physical threats was not clearly articulated by any participant. This may be because of the open-ended nature of our semi-structured interview questions, which did not prompt participants specifically about online safety needs; on the other hand, with the structure and context provided by our card sorting exercise of inflammatory tweet mentions, the restrictiveness of closed card sorting may not have allowed participants to discuss alternative ways they would have liked to process harmful content.
We conducted our user testing study as a time-boxed exploration of PressProtect’s interface with a small, curated set of participants’ past tweet data. This approach is limited in that it fails simulate journalists’ in situ usage of PressProtect as a long-term solution. When we set out to build PressProtect in February2023, we envisioned deploying it as a real-world tool that integrated with Twitter/X through its API, hooking into the notification system of the platform itself as the mechanism for controlling the journalist’s exposure to reader engagement. However, as access to the Twitter/X API degraded and became prohibitively expensive just weeks later(X, 2023),PressProtect’s deployment as a usable application became impossible. As a result, our goal shifted to understanding the effectiveness of PressProtect’s underlying abstraction to reason about content in a way that is helpful to journalists, rather than to evaluate a real-world system’s performance or usability in practice. We hope that the insights from our design exploration with PressProtect can be integrated into future platform or tool designs.
7. Conclusion
In this work, we formalized the outstanding needs of journalists to safely participate in online spaces when facing significant harassment, drawing from the findings of our eight need-finding interviews. We synthesized a set of design goals to make journalists’ social media experience safer and more usable, from which we built PressProtect, an interface for Twitter/X that leverages a logical abstraction for reader comments and uses UI controls for viewing comments as its protective mechanism. Through user testing with a separate set of eight journalists, we demonstrated that PressProtect’s abstraction and protective mechanism effectively served journalists; however, user testing also revealed a previously unidentified need for journalists to be able to discern physical threats from other harassment. Our findings deepen our understanding of vulnerable communities’ needs as multi-dimensional and central to safe, effective platform design and suggest opportunities for collaboration between news organizations, platforms, and third-party developers to better achieve multi-level governance.
8. Acknowledgements
We thank our participants for their time and openness in sharing their experiences with us and providing formative feedback to refine the design of PressProtect. We are also grateful to Waliya Lari and AAJA for connecting us to their community. We thank Michael Bernstein for his insight on the framing and design of our need-finding study and Marissa Henri for her help designing PressProtect’s user interface. This work was supported in part by a Magic Grant from the Brown Institute for Media Innovation, a Sloan Research Fellowship, an NSF Graduate Research Fellowship #DGE-1656518, and NSF grants #2030859 and #2127309 to the Computing Research Association for the CIFellows Project.
References
- (1)
- per (2023)2023.Perspective API.https://perspectiveapi.com/.
- Ashktorab and Vitak (2016)Zahra Ashktorab andJessica Vitak. 2016.Designing cyberbullying mitigation and preventionsolutions through participatory design with teenagers. InProceedings of the 2016 CHI conference on humanfactors in computing systems.
- Avery E.Holton and Molyneux (2023)DianaBossio Avery E.Holton, ValérieBélair-Gagnon and Logan Molyneux.2023.“Not Their Fault, but Their Problem”:Organizational Responses to the Online Harassment of Journalists.Journalism Practice 17,4 (2023), 859–874.
- Bellini (2023)Rosanna Bellini.2023.Paying the Price: When Intimate Partners UseTechnology for Financial Harm. In Proceedings ofthe 2023 CHI Conference on Human Factors in Computing Systems.1–17.
- Binder (2023)Matt Binder.2023.Twitter’s API keeps breaking, even for developerspaying $42,000.Mashable (2023).https://mashable.com/article/twitter-api-elon-musk-developer-issues-apps
- Blackwell etal. (2018)Lindsay Blackwell,Tianying Chen, Sarita Schoenebeck, andCliff Lampe. 2018.When online harassment is perceived as justified.In Proceedings of the Twelfth International AAAIConference on Web and Social Media (ICWSM 2018).AAAI, Menlo Park, CA, USA,22–31.
- Blackwell etal. (2017)Lindsay Blackwell, JillDimond, Sarita Schoenebeck, and CliffLampe. 2017.Classification and Its Consequences for OnlineHarassment: Design Insights from HeartMob.Proc. ACM Hum.-Comput. Interact.1, CSCW, Article 24(Dec. 2017), 19pages.https://doi.org/10.1145/3134659
- Cai etal. (2023)Jie Cai, SagnikChowdhury, Hongyang Zhou, andDongheeYvette Wohn. 2023.Hate Raids on Twitch: Understanding Real-TimeHuman-Bot Coordinated Attacks in Live Streaming Communities.Proc. ACM Hum.-Comput. Interact.CSCW (2023).
- Calma (2023)Justine Calma.2023.Twitter just closed the book on academic research.The Verge (2023).https://www.theverge.com/2023/5/31/23739084/twitter-elon-musk-api-policy-chilling-academic-research
- Chen etal. (2020)GinaMasullo Chen,Paromita Pain, VictoriaY Chen,Madlin Mekelburg, Nina Springer, andFranziska Troger. 2020.‘You really have to have a thick skin’: Across-cultural perspective on how online harassment influences femalejournalists.Journalism 21,7 (2020), 877–895.
- Chen Wu etal. (2013)Cou Chen Wu, YvesHuang, and Chia Lin Hsu.2013.Benevolence trust: a key determinant of usercontinuance use of online social networks.Information Systems and e-BusinessManagement (2013).
- Collins and Bilge (2016)PatriciaHill Collins andSirma Bilge. 2016.Intersectionality.John Wiley & Sons, Hoboken,NJ.
- Crenshaw (1991)Kimberle Crenshaw.1991.Mapping the Margins: Intersectionality, IdentityPolitics, and Violence against Women of Color.43, 6 (1991),1241–1299.
- Deavours etal. (2023)Danielle Deavours, WillHeath, Kaitlin Miller, Misha Viehouser,Sandra Palacios-Plugge, and RyanBroussard. 2023.Reciprocal journalism’s double-edged sword: Howjournalists resolve cognitive dissonance after experiencing harassment fromaudiences on social media.Journalism 24,11 (2023), 2454–2473.
- Ferrier and Garud-Patkar (2018)Michelle Ferrier andNisha Garud-Patkar. 2018.TrollBusters: Fighting online harassment of womenjournalists.Mediating misogyny: Gender, technology, andharassment (2018).
- Gilbert (2023)Sarah Gilbert.2023.Towards intersectional moderation: An alternativemodel of moderation built on care and power.Proceedings of the ACM on Human-ComputerInteraction (2023).
- Gorrell etal. (2018)Genevieve Gorrell, MarkGreenwood, Ian Roberts, Diana Maynard,and Kalina Bontcheva. 2018.Twits, twats and twaddle: Trends in online abusetowards uk politicians. In Proceedings of theInternational AAAI Conference on Web and Social Media.
- Goyal etal. (2022)Nitesh Goyal, LesliePark, and Lucy Vasserman.2022.”You have to prove the threat is real”:Understanding the needs of Female Journalists and Activists to Document andReport Online Harassment. In Proceedings of the2022 CHI Conference on Human Factors in Computing Systems.
- Han etal. (2023)Catherine Han, JosephSeering, Deepak Kumar, JeffreyT.Hanco*ck, and Zakir Durumeric.2023.Hate Raids on Twitch: Echoes of the Past, NewModalities, and Implications for Platform Governance.Proc. ACM Hum.-Comput. Interact.CSCW (2023).
- Holton etal. (2023)AveryE Holton,Valérie Bélair-Gagnon, DianaBossio, and Logan Molyneux.2023.“Not their fault, but their problem”:Organizational responses to the online harassment of journalists.Journalism Practice (2023).
- Hua etal. (2020)Yiqing Hua, Mor Naaman,and Thomas Ristenpart. 2020.Characterizing twitter users who engage inadversarial interactions against political candidates. InProceedings of the 2020 CHI conference on humanfactors in computing systems.
- Jeong (2023)Sarah Jeong.2023.Goodbye to all that harassment.The Verge (2023).https://www.theverge.com/c/features/23997516/harassment-twitter-sarah-jeong-canceled-social-change
- Jhaver etal. (2022)Shagun Jhaver, QuanZeChen, Detlef Knauss, and AmyX Zhang.2022.Designing word filter tools for creator-led commentmoderation. In Proceedings of the 2022 CHIConference on Human Factors in Computing Systems.
- Jhaver etal. (2023)Shagun Jhaver, Seth Frey,and AmyX. Zhang. 2023.Decentralizing Platform Power: A Design Space ofMulti-Level Governance in Online Social Platforms.Social Media + Society 9,4 (2023).
- Jhaver etal. (2018)Shagun Jhaver, SuchetaGhoshal, Amy Bruckman, and EricGilbert. 2018.Online Harassment and Content Moderation: The Caseof Blocklists.ACM Trans. Comput.-Hum. Interact.25, 2, Article 12(March 2018), 33pages.https://doi.org/10.1145/3185593
- Jigsaw (2016)Jigsaw. 2016.New York Times and Jigsaw Partner to ScaleModeration Platform.Technical Report. Jigsaw.https://medium.com/jigsaw/new-york-times-and-jigsaw-partner-to-scale-moderation-platform-7959b698a562
- Krook and Sanín (2020)MonaLena Krook andJulianaRestrepo Sanín.2020.The cost of doing politics? Analyzing violence andharassment against female politicians.Perspectives on Politics(2020).
- Kumar etal. (2023)Deepak Kumar, JeffHanco*ck, Kurt Thomas, and ZakirDurumeric. 2023.Understanding the behaviors of toxic accounts onreddit. In Proceedings of the ACM Web Conference2023.
- Kumar etal. (2021)Deepak Kumar, PatrickGageKelley, Sunny Consolvo, Joshua Mason,Elie Bursztein, Zakir Durumeric,Kurt Thomas, and Michael Bailey.2021.Designing toxic content classification for adiversity of perspectives. In SeventeenthSymposium on Usable Privacy and Security (SOUPS 2021).
- Kumar etal. (2017)Srijan Kumar, JustinCheng, Jure Leskovec, and V.S.Subrahmanian. 2017.An Army of Me: Sockpuppets in Online DiscussionCommunities. In WWW ’17.
- Kupper and Hafner (1989)LawrenceL. Kupper andKerryB. Hafner. 1989.On Assessing Interrater Agreement for MultipleAttribute Responses.45, 3 (1989),957–967.
- Lee and Park (2023)NaYeon Lee and AhranPark. 2023.How online harassment affects Korean journalists?The effects of online harassment on the journalists’ psychological problemsand their intention to leave the profession.Journalism 0,0 (2023),14648849231166511.
- Lewis etal. (2014)SethC. Lewis, AveryE.Holton, and Mark Coddington.2014.Reciprocal Journalism: A concept of mutual exchangebetween journalists and audiences. In JournalismPractice.
- Lewis etal. (2020)SethC Lewis, RodrigoZamith, and Mark Coddington.2020.Online harassment and its implications for thejournalist–audience relationship.Digital Journalism (2020).
- Lorenz (2023)Taylor Lorenz.2023.https://www.washingtonpost.com/investigations/2023/02/14/women-journalists-global-violence/.
- Mahar etal. (2018)Kaitlin Mahar, AmyXZhang, and David Karger.2018.Squadbox: A tool to combat email harassment usingfriendsourced moderation. In Proceedings of the2018 CHI Conference on Human Factors in Computing Systems.
- Marsden (2017)Maeve Marsden.2017.In defence of the internet ’pile on’.The Sunday Morning Harold(2017).https://www.smh.com.au/lifestyle/in-defence-of-the-internet-pile-on-20170621-gwvh2z.html
- Marwick (2021)AliceE Marwick.2021.Morally Motivated Networked Harassment asNormative Reinforcement.Social Media + Society 7,2 (2021),20563051211021378.https://doi.org/10.1177/20563051211021378
- McDonald etal. (2021)Allison McDonald,Catherine Barwulor, MichelleL Mazurek,Florian Schaub, and ElissaM Redmiles.2021.” It’s stressful having all these phones”:Investigating Sex Workers’ Safety Goals, Risks, and Practices Online. In30th USENIX Security Symposium (USENIX Security21).
- McDonald etal. (2019)Nora McDonald, SaritaSchoenebeck, and Andrea Forte.2019.Reliability and Inter-Rater Reliability inQualitative Research: Norms and Guidelines for CSCW and HCI Practice.Proc. ACM Hum.-Comput. Interact.3, CSCW, Article 72(Nov. 2019), 23pages.https://doi.org/10.1145/3359174
- Nadim and Fladmoe (2021)Marjan Nadim and AudunFladmoe. 2021.Silencing women? Gender and online harassment.Social Science Computer Review(2021).
- Nelson (2021)JacobL. Nelson.2021.A Twitter tightrope without a net: Journalists’reactions to newsroom social media policies.Columbia Journalism Review(2021).https://www.cjr.org/tow_center_reports/newsroom-social-media-policies.php
- Nelson (2023)JacobL Nelson.2023.“Worse than the Harassment Itself.”Journalists’ Reactions to Newsroom Social Media Policies.Digital Journalism (2023).
- Party (2023a)Block Party.2023a.https://www.blockpartyapp.com/.
- Party (2023b)Block Party.2023b.Block Party’s Twitter product is on indefinitehiatus as of May 31.Retrieved July 25, 2023 from https://www.blockpartyapp.com/blog/twitter-hiatus/
- Posetti etal. (2022)Julie Posetti, KalinaBontcheva, and Nabeelah Shabbir.2022.The Chilling: assessing big tech’s responseto online violence against women journalists.Technical Report. UNESCO,France.
- Powell etal. (2020)Anastasia Powell, AdrianJScott, and Nicola Henry.2020.Digital harassment and abuse: Experiences ofsexuality and gender minority adults.European journal of criminology(2020).
- Quodling (2015)Andrew Quodling.2015.Doxxing, swatting and the new trends in onlineharassment.The Conversation (2015).https://theconversation.com/doxxing-swatting-and-the-new-trends-in-online-harassment-40234
- Salehi etal. (2023)Niloufar Salehi, RoyaPakzad, Nazita Lajevardi, and MariamAsad. 2023.Sustained Harm Over Time and Space Limits theExternal Function of Online Counterpublics for American Muslims.Proceedings of the ACM on Human-ComputerInteraction (2023).
- Samermit etal. (2023)Patrawat Samermit, AnnaTurner, PatrickGage Kelley, TaraMatthews, Vanessia Wu, Sunny Consolvo,and Kurt Thomas. 2023.“Millions of people are watching you”:Understanding the Digital-Safety Needs and Practices of Creators. In32nd USENIX Security Symposium (USENIX Security23).
- Saunders etal. (2018)Benjamin Saunders, JuliusSim, Tom Kingstone, Shula Baker,Jackie Waterfield, Bernadette Bartlam,Heather Burroughs, and Clare Jinks.2018.Saturation in qualitative research: exploring itsconceptualization and operationalization.Qual Quant 52,4 (2018), 1893–1907.
- Saveski etal. (2021)Martin Saveski, BrandonRoy, and Deb Roy. 2021.The structure of toxic conversations on Twitter.In ACM Web Conference.
- Scheuerman etal. (2018)MorganKlaus Scheuerman,StacyM. Branham, and Foad Hamidi.2018.Safe Spaces and Safe Places: UnpackingTechnology-Mediated Experiences of Safety and Harm with Transgender People.Proc. ACM Hum.-Comput. Interact.2, CSCW, Article 155(Nov. 2018), 27pages.https://doi.org/10.1145/3274424
- Schoenebeck etal. (2023)Sarita Schoenebeck, AmnaBatool, Giang Do, Sylvia Darling,Gabriel Grill, Daricia Wilkinson,Mehtab Khan, Kentaro Toyama, andLouise Ashwell. 2023.Online Harassment in Majority Contexts: ExaminingHarms and Remedies across Countries. InProceedings of the 2023 CHI Conference on HumanFactors in Computing Systems.
- Sjøvaag (2018)Helle Sjøvaag.2018.Oxford Research Encyclopedia ofCommunication.Oxford University Press, Chapter Journalism’sSocial Contract.
- Tandoc etal. (2023)EdsonC Tandoc, KarrylKimSagun, and KatrinaPaola Alvarez.2023.The digitization of harassment: Womenjournalists’ experiences with online harassment in the Philippines.Journalism Practice (2023).
- The NewYork Times (2017)The NewYork Times.2017.The Times Issues Social Media Guidelines for theNewsroom.The New York Times(2017).https://www.nytimes.com/2017/10/13/reader-center/social-media-guidelines.html
- Thomas etal. (2021)Kurt Thomas, DevdattaAkhawe, Michael Bailey, Dan Boneh,Sunny Consolvo, Nicola Dell,Zakir Durumeric, PatrickGage Kelley,Deepak Kumar, Damon McCoy,Sarah Meiklejohn, Thomas Ristenpart,and Gianluca Stringhini.2021.SoK: Hate, Harassment, and the Changing Landscapeof Online Abuse. In IEEESP.
- Thomas etal. (2022)Kurt Thomas, PatrickGageKelley, Sunny Consolvo, PatrawatSamermit, and Elie Bursztein.2022.“It’s common and a part of being a contentcreator”: Understanding How Creators Experience and Cope with Hate andHarassment Online. In Proceedings of the 2022 CHIConference on Human Factors in Computing Systems.
- To etal. (2023)Alexandra To, Angela D.R.Smith, Dilruba Showkat, AdinawaAdjagbodjou, and Christina Harrington.2023.Flourishing in the Everyday: Moving BeyondDamage-Centered Design in HCI for BIPOC Communities. InProceedings of the 2023 ACM Designing InteractiveSystems Conference (Pittsburgh, PA, USA) (DIS’23). Association for Computing Machinery,New York, NY, USA, 917–933.
- Together (2023)Block Together.2023.https://blocktogether.org.
- Twitter ([n. d.])Twitter.[n. d.].How journalists can use Twitter safely.https://business.twitter.com/en/blog/how-journalists-can-use-twitter-safely.html.Accessed: 2024-01-16.
- UNESCO (2021)UNESCO. 2021.Journalism is a public good: world trendsin freedom of expression and media development: global report 2021/2022;Highlights.Technical Report. UNESCO,Paris, France.
- Vitak etal. (2017)Jessica Vitak, KalyaniChadha, Linda Steiner, and ZahraAshktorab. 2017.Identifying women’s experiences with and strategiesfor mitigating negative effects of online harassment. InProceedings of the 2017 ACM Conference on ComputerSupported Cooperative Work and Social Computing.
- Wagner (2022)Angelia Wagner.2022.Tolerating the trolls? Gendered perceptions ofonline harassment of politicians in Canada.Feminist Media Studies(2022).
- X (2023)X. 2023.https://twitter.com/XDevelopers/status/1621026986784337922.Accessed: 2024-01-16.
- Xia etal. (2020)Yan Xia, Haiyi Zhu,Tun Lun, Peng Zhang, andNing Gu. 2020.Exploring antecedents and consequences of toxicityin online discussions: A case study on Reddit.CSCW (2020).
- Xiao etal. (2022)Sijia Xiao, CoyeCheshire, and Niloufar Salehi.2022.Sensemaking, support, safety, retribution,transformation: A restorative justice approach to understandingadolescents’ needs for addressing online harm. InProceedings of the 2022 CHI Conference on HumanFactors in Computing Systems.
Appendix A Need-finding Interview Questions
A.1. Questions about demographics and Internet usage
- •
How long have you been a journalist?
- •
What types of stories do you usually cover?
- •
What social media platforms do you regularly use?
- •
What do you primarily use social media for?
- •
Is social media usage a core part of your professional career?
- •
What social media platforms are most important to you and your professional career?
A.2. Questions about harassment experiences, harms, and defenses
- •
Can you tell me about a time when you experienced online hate and harassment on Twitter?
- •
What was the experience like? What happened, and how was it triggered?
- •
What were the types of messages that were sent to you? How were they sent – via a DM? A mention?