<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.0 20120330//EN" "http://jats.nlm.nih.gov/publishing/1.0/JATS-journalpublishing1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" article-type="research-article" xml:lang="en">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">IR</journal-id>
<journal-title-group>
<journal-title>Information Research</journal-title>
</journal-title-group>
<issn pub-type="epub">1368-1613</issn>
<publisher>
<publisher-name>University of Bor&#x00E5;s</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">ir30iConf47197</article-id>
<article-id pub-id-type="doi">10.47989/ir30iConf47197</article-id>
<article-categories>
<subj-group xml:lang="en">
<subject>Research article</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>&#x2018;Everyone has their reasons for curating the data they have decided to keep&#x2019;: a thematic analysis of data hoarding as digital curation practice</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author"><name><surname>Maemura</surname><given-names>Emily</given-names></name>
<xref ref-type="aff" rid="aff0001"/></contrib>
<contrib contrib-type="author"><name><surname>Wagner</surname><given-names>Travis L.</given-names></name>
<xref ref-type="aff" rid="aff0002"/></contrib>
<aff id="aff0001"><bold>Emily Maemura</bold> is Assistant Professor at the School of Information Sciences, University of Illinois Urbana-Champaign, USA. She completed her PhD at the University of Toronto&#x2019;s Faculty of Information. Her research focuses on data practices and the activities of curation, description, characterization, and re-use of archived web data. She can be contacted at <email xlink:href="emaemura@illinois.edu">emaemura@illinois.edu</email></aff>
<aff id="aff0002"><bold>Travis L. Wagner</bold> is Assistant Professor at the School of Information Sciences, University of Illinois Urbana-Champaign, USA. They completed their PhD at the University of South Carolina&#x2019;s School of Information Science. Their research focuses on the sociotechnical relationships between archival artifacts, digital curation technologies, and community representation, with an emphasis on its impact on LGBTQIA+ communities. They can be contacted at <email xlink:href="wagnert@illinois.edu">wagnert@illinois.edu</email></aff>
</contrib-group>
<pub-date pub-type="epub"><day>06</day><month>05</month><year>2025</year></pub-date>
<pub-date pub-type="collection"><year>2025</year></pub-date>
<volume>30</volume>
<issue>i</issue>
<fpage>789</fpage>
<lpage>797</lpage>
<permissions>
<copyright-year>2025</copyright-year>
<copyright-holder>&#x00A9; 2025 The Author(s).</copyright-holder>
<license license-type="open-access" xlink:href="https://creativecommons.org/licenses/by-nc/4.0/">
<license-p>This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (<ext-link ext-link-type="uri" xlink:href="http://creativecommons.org/licenses/by-nc/4.0/">http://creativecommons.org/licenses/by-nc/4.0/</ext-link>), permitting all non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
</license>
</permissions>
<abstract xml:lang="en">
<title>Abstract</title>
<p><bold>Introduction.</bold> This paper presents a preliminary analysis of the r/DataHoarder subreddit, an online community focused on social, legal, and technical practices related to digital curation. We contrast how they conceptualize their practices with the models, frameworks, and capabilities established for digital curation in more traditional memory institutions.</p>
<p><bold>Method.</bold> We use thematic analysis to analyse top posts (n=170) from the subreddit, to determine how members conceptualize, describe, and enact their data hoarding practices.</p>
<p><bold>Findings.</bold> Two key themes are identified: a focus on materiality of storage for hoarding, and use of the subreddit to promote calls-to-action. Each theme is further analysed, identifying underlying motivations such as nostalgia and vigilantism.</p>
<p><bold>Discussion.</bold> We briefly discuss the theoretical and practical implications of these findings in contrast with workflows like the DCC curation lifecycle, while also addressing limitations of this preliminary work, and outlining potential new research directions moving forward.</p>
<p><bold>Conclusion.</bold> We have found this analysis presents useful counterpoints to commonly referenced and standardised practices of memory institutions. We believe this will continue to be a fruitful area of study as we conduct future work on how r/DataHoarder and similar communities conceptualise, practice, and develop their own ethical framework toward sustained access of digital materials. </p>
</abstract>
</article-meta>
</front>
<body>
<sec id="sec1">
<title>Introduction</title>
<p>The r/DataHoarder subreddit is an online community focused on social, legal, and technical practices related to digital curation. Created in May 2013 and with a current membership of over 700,000 users, it is neither representative of any institution nor defined by a shared digital curation credo. Yet, r/DataHoarder nonetheless defines itself as a community of <italic>&#x2018;digital librarians&#x2019;</italic> who have varied <italic>&#x2018;reasons for curating the data they have decided to keep (either forever or For A Damn Long Time<sup>tm</sup>)&#x2019;</italic> (<xref rid="R27" ref-type="bibr">r/DataHoarder, 2017</xref>). Regardless of type of data or reasons for hoarding, r/DataHoarder members share best practices, technical guidance, and, on occasion, ethical stances towards the increasingly complex sociotechnical landscape of a digital-first world. As such, r/DataHoarder acts as a distributed community of practice, enacting digital curation through deinstitutionalized and extralegal means.</p>
<p>A recent example of r/DataHoarder&#x2019;s collective efforts is seen in their archiving work in the wake of the January 6th United States Capitol attack. The subreddit&#x2019;s users collaborated to document and preserve footage from the attack as well as communications between those involved, which received both scholarly and journalistic attention (<xref rid="R1" ref-type="bibr">Basu, 2021</xref>; Chapman, 2022). This example demonstrates how, despite a distributed community structure, r/DataHoarder&#x2019;s members also coalesce around shared ethics and practices. We take an interest in exploring these motivations, where they both align with professional guidelines, or alternately exhibit distrust for traditional collecting institutions (i.e., government agencies), and how they work intentionally outside these institutional boundaries.</p>
<p>This study therefore examines how r/DataHoarder imagines and actualizes digital curation practice. We consider how such framings align with and depart from standards and practices of digital curation within traditional memory institutions. We present a preliminary thematic analysis of the top posts which have a vote count of 2,000 or more (Panek, 2021). We present this exploratory study to address the following research questions:
<list list-type="bullet">
<list-item><p>What is the practice of <italic>&#x2018;data hoarding&#x2019;</italic> and what activities and capabilities does it entail?</p></list-item>
<list-item><p>Where do these activities and capabilities intersect with or diverge from those of memory institutions&#x2019; practices of curation?</p></list-item>
<list-item><p>What do these practices reveal of the ethics and social ideologies underlying data hoarding work, and motivations of individual data-hoarders?</p></list-item>
</list></p>
</sec>
<sec id="sec2">
<title>Literature review</title>
<p>Over the past three decades, scholarship in digital preservation and curation has produced a knowledge base of best practices, primarily focused on conducting this work within organizational settings. This includes the standardization of preservation systems and functions through the open archival information system reference model, as well as common practices and activities as referenced in the digital preservation coalition (DPC) handbook and outlined in the digital curation centre (DCC) lifecycle. The specific needs for organizational and individual skillsets in conducting preservation and curation have also been studied, prescribed, and measured through capability models and tools like the NDSA levels of preservation, DPC competency audit tool, and DLF&#x2019;s levels of born-digital access (NDSA, 2019; <xref rid="R20" ref-type="bibr">McMeekin &#x0026; Currie, 2022</xref>; <xref rid="R25" ref-type="bibr">Peltzman et al., 2022</xref>).</p>
<p>Largely separate from these organizational approaches, we see increased public awareness of digital content at risk of loss. Recent grassroots collecting of digital materials include: work by the environmental data governance initiative to collect EPA data prior to the beginning of President Trump&#x2019;s term in office in 2017 (<xref rid="R29" ref-type="bibr">Walker et al., 2018</xref>); work by archive team to collect NSFW materials following Tumblr&#x2019;s policy change and removals in 2018 (<xref rid="R23" ref-type="bibr">Ogden, 2022</xref>); efforts of Saving Ukrainian Cultural Heritage Online (SUCHO) to preserve digital materials from Ukraine following the Russian invasion in February 2022 (<xref rid="R17" ref-type="bibr">LeBlanc et al., 2022</xref>). We observe that these forms of collecting, and preservation differ significantly from the work by practitioners within traditional memory institutions. We take inspiration from Dallas (<xref rid="R9" ref-type="bibr">2016</xref>) who advises digital curation research should,</p>
<p><italic>focus on the study of actual practices of curation in a diversity of contexts &#x2026; and should prioritize such intellectual inquiry over the imposition of models developed within the custodial fold</italic> (p. 440).</p>
<p>We therefore take up a concern with understanding collecting and curation outside of memory institutions.</p>
<p>For this initial, exploratory study, we frame this work as a <italic>&#x2018;community of practice&#x2019;</italic> to understand how groups of diverse and distributed individuals engage in digital curation. Coined by Lave and Wenger (<xref rid="R16" ref-type="bibr">1991</xref>), the concept of <italic>&#x2018;communities of practice&#x2019;</italic> (CoPs) originates in organizational studies. Cox (<xref rid="R8" ref-type="bibr">2005</xref>) addresses the concept&#x2019;s varied usage and distinctions between the early applications of CoPs, and later take-up by Brown and Duguid (<xref rid="R5" ref-type="bibr">1991</xref>, <xref rid="R6" ref-type="bibr">2001</xref>), as well as Wenger&#x2019;s (<xref rid="R30" ref-type="bibr">1998</xref>, <xref rid="R31" ref-type="bibr">2000</xref>) subsequent publications. In information studies, the concept is often deployed to describe or facilitate efforts in community building around shared practices, both within and across organizations. For digital curation in particular, CoPs have been the guiding framework for developing capacity-building workshops at the local and national level (<xref rid="R21" ref-type="bibr">Moran et al., 2019</xref>, <xref rid="R28" ref-type="bibr">Rios et al. 2020</xref>).</p>
<p>We adopt a broader view of CoPs, as advocated by Bowker and Star (<xref rid="R7" ref-type="bibr">2000</xref>) who place less emphasis on the organizational setting. This understanding of CoPs has been previously applied to study the knowledge practices of amateur and hobby communities, and particularly their distributed formation in virtual space (<xref rid="R13" ref-type="bibr">Hills, 2015</xref>). Several precedents also explore this type of community formation for learning and sharing practices specifically through reddit and its structure of subreddits (<xref rid="R4" ref-type="bibr">Britt et al., 2022</xref>; <xref rid="R12" ref-type="bibr">Haythornthwaite et al., 2018</xref>; <xref rid="R14" ref-type="bibr">Hudgins et al., 2020</xref>; <xref rid="R15" ref-type="bibr">Kwon et al. 2020</xref>). Proferes et al. (<xref rid="R26" ref-type="bibr">2021</xref>) provide a systematic overview of past research with reddit, which we also use to inform our choice in data collection, scale and methods of analysis, and ethical decisions for working with public but potentially sensitive and personal posts and comments.</p>
</sec>
<sec id="sec3">
<title>Method</title>
<p>This preliminary study deploys thematic analysis to examine how members of the r/DataHoarder subreddit conceptualize, describe, and enact their data hoarding practices. Thematic analysis allows researchers to qualitatively surface ideas generated by individuals, communities, or institutions. As noted in Braun and Clarke (<xref rid="R5" ref-type="bibr">2006</xref>), theme identification does not necessarily correlate with a quantitative measure such as prevalence in the dataset. We follow Braun and Clark&#x2019;s six phases of thematic analysis: familiarization with data; generating initial codes; searching for themes; reviewing themes; defining themes; and reporting findings (<xref rid="R5" ref-type="bibr">Braun &#x0026; Clarke, 2006</xref>, p. 87-93). We employ thematic analysis alongside theoretical commitments from CoPs research, whose concepts serve as guide points for framing themes deductively, while still offering space for inductive themes to emerge.</p>
<p>Reddit has been studied using thematic analysis within scholarly fields ranging from gender and sexuality studies to addiction medicine (<xref rid="R26" ref-type="bibr">Proferes et al., 2021</xref>; <xref rid="R11" ref-type="bibr">Graves et al., 2022</xref>; Graham &#x0026; Rodriguez, 2021). When applying thematic analysis towards Reddit as a site with multiple CoPs, we note past work that addresses the complex, often socially contentious practices this platform affords users, especially as a means to examine topics which they might face criticism or institutionalized intervention (<xref rid="R18" ref-type="bibr">Lundeen et al., 2024</xref>; <xref rid="R19" ref-type="bibr">Maxwell et al., 2020</xref>).</p>
<sec id="sec3_1">
<title>Data collection</title>
<p>We began data collection by gathering the subreddit&#x2019;s most popular posts throughout its history (as of August 2024). A cutoff point of 2000 votes (tallying upvotes and downvotes) was chosen for this initial analysis, resulting in 170 total posts included in the analysis. Posts include a range of text-based original posts, cross-posting of links to outside news articles, photographs, and memes. The links to these 170 posts were documented within a shared spreadsheet for purposes of analysis. Post titles, authorship (reddit username) and select excerpted comments were also collected for select posts but will not be shared through direct quotes or reproduced in the analysis here. While this information is available on the public web, we follow guidance from Proferes et al. (<xref rid="R26" ref-type="bibr">2021</xref>) to respect the privacy of social media users, especially as this initial analysis is not yet able to address the extent of sensitive and personal information shared within these posts or the potential for users&#x2019; identities to be triangulated through other posts and comments made on reddit.</p>
</sec>
<sec id="sec3_2">
<title>Data analysis</title>
<p>Following the iterative process described in Braun and Clarke (<xref rid="R5" ref-type="bibr">2006</xref>), the researchers deployed constant comparative coding to identify both emergent themes, while also working to create shared definitions for the themes which emerged. Work began with coding the top ten percent of the total posts within the dataset (n=17 posts), ranked by vote-count. For each post, we independently analysed the text, images, comments, and interactions to develop an initial set of codes that would inform the development of overarching themes. Upon identifying these codes, we then developed an initial set of themes that applied to these first 17 posts, before proceeding to validate these themes against the subsequent top posts to ensure that no other salient codes emerged. As an intermediate step in this iterative process, a set of high-level codes emerged. Sample codes included: materiality; value; and preventing loss. We then reviewed the remaining posts, each focusing primarily on identifying posts and comments aligned with one of these themes. A final round of discussion and refinement led to the identification of two key, overarching themes which we characterize and discuss in greater detail below.</p>
</sec>
</sec>
<sec id="sec4">
<title>Findings</title>
<p>Based on this initial analysis, we identify a set of key themes that define the r/DataHoarder vision of digital curation as a practice.</p>
<sec id="sec4_1">
<title>Theme (1) materiality of hoarding</title>
<p>Many of the top posts in r/DataHoarder highlight the materiality of hoarding, through imagery of storage media (e.g., photos of hard drives) as well as discussion of services and costs (e.g., fees for cloud storage or high-bandwidth ISP usage). Within this space, talking about and demonstrating the capacity for data storage becomes a central signifier of one&#x2019;s membership in the community. Practices enacted around materiality include sharing information on acquiring storage (notices of hard drives on sale), as well as sharing advice about in-home storage set-ups. We also take materiality to include the processes of acquiring data or digital objects, such as offline downloads of <italic>&#x2018;all of Wikipedia,&#x2019;</italic> and comments often signaling the creation of such a collection as a telltale sign of digital hoarding. Materiality as presented by r/DataHoarder posts often conflates data and digital objects, and ultimately emphasizes having control over digital materials through such storage.</p>
<sec id="sec4_1_1">
<title>Subtheme 1.1 fetishization and bragging</title>
<p>An additional facet of this focus on materiality emerges in the fetishization of cutting-edge hardware and technologies such as multiple petabyte hard drives. One post garnering significant comments and discussion includes a photo of the hard drives storing the data used to generate the first ever photographic image of a blackhole in 2019. Indicative of the fixation not on the black hole photograph itself, but instead of the storage devices, commenters metaphorically fetishize the hardware via suggestions of sexual pleasure in response to seeing these images. Another popular post presents the message received from a cloud hosting platform, that the user has exceeded their storage capacity, representative of a trend in which members brag about surpassing limitations on data plans. Not only are members concerned with storage capabilities, but the aspect bragging about capacities of hardware or cloud-based storage, often fetishizing the ability to store previously unimaginable amounts of information.</p>
</sec>
<sec id="sec4_1_2">
<title>Subtheme 1.2 nostalgia</title>
<p>In contrast to the fetishization of ever-newer, bigger storage, we observe how nostalgic leanings also shape the view of digital materials. One post describes a <italic>&#x2018;collapse-proof laptop,&#x2019;</italic> with comments discussing the utility of smaller, often obsolete formats and storage hardware as a mode to counteract anticipated corporate and institutional failures harkening back to a <italic>&#x2018;simpler time&#x2019;</italic> of computing&#x2019;s past. This aligns with a survivalist mentality of <italic>&#x2018;internet collapse preppers&#x2019;,</italic> (<xref rid="R27" ref-type="bibr">r/DataHoarder, 2017</xref>) and the need to create tools which function without pervasive online connections and cloud services (<xref rid="R10" ref-type="bibr">Grandhi et al., 2020</xref>). In the same post, members suggest retaining copies of video games such as Tetris and Quake II which reflect low data demand yet represent high yield entertainment. The role of software from one&#x2019;s youth and a tendency to focus on collections around retro-gaming is also indicative of this nostalgic turn, as seen in another post referencing collecting rare data tapes from Nintendo 64&#x2019;s development. The interest in materiality therefore often overlaps with this nostalgia for obsolete hardware and formats which recalled simpler ecosystems for hoarding data and digital objects.</p>
</sec>
</sec>
<sec id="sec4_2">
<title>Theme (2) call-to-action</title>
<p>The call-to-action is one of the themes we largely anticipated at the outset of this study, as it aligns with r/DataHoarder activities related to the January 6th United States Capitol. Since the subreddit&#x2019;s description focuses largely on its role as a space for <italic>&#x2018;like-minded individuals to exchange strategies, war stories, and cautionary tales of failures,&#x2019;</italic> (<xref rid="R27" ref-type="bibr">r/DataHoarder, 2017</xref>) these moments that we identify as calls-to-action are rather exceptional in relation to the day-to-day discussions on the subreddit. Importantly, these interventions provide the rare opportunity for data-hoarders to directly collaboration and take collective action. Many calls-to-action revolve around urgent collecting of materials directly at risk due to corporate decisions for removal, as well as political turmoil or threats of censorship. For instance, an additional example from our dataset calls on the r/DataHoarder community to help &#x2018;backup&#x2019; the social media and website of Hong Kong Stand News after their headquarters were raided and staff arrested by national security police in 2021.</p>
<sec id="sec4_2_1">
<title>Subtheme 2.1 vigilance</title>
<p>Related to direct calls-to-action, we observe that participants in r/DataHoarder express the need to remain vigilant, or <italic>&#x2018;stand watch&#x2019;</italic> in the face of potential future threats. This includes activities such as: sharing details on proposed changes to legislation affecting data transfer, privacy, and encryption; news of legal cases won or lost by perceived adversaries (namely large companies like Apple, Microsoft, YouTube, Yahoo, as well as Reddit itself), as well as ongoing legal issues experienced by the perceived ally, non-profit Internet Archive.</p>
</sec>
<sec id="sec4_2_2">
<title>Subtheme 2.2 vigilantism</title>
<p>Aligned with the overarching theme of call-to-action, the motivations driving many individuals <italic>to</italic> act often appear related to an ideal of vigilante justice. This surfaces in direct ways, as r/DataHoarder posters position their work as countering the aims of unethical and unjust companies. The r/DataHoarder community therefore justifies the use of hacking, piracy, and extralegal means for taking data out of the hands of these corporations, and hoarding is perceived as contributing to the public good of making information free, open, and accessible. Several examples relate to specific projects in open science and open culture, including one post linking a repository of thousands of scientific articles on COVID-19, made openly available through torrents and decentralized web links, referencing deceased hacktivist Aaron Swartz. Beyond these direct references to piracy and hacktivism, the vigilante ethos arises across the subreddit&#x2019;s discussions, for instance a post offering a &#x2018;bounty&#x2019; for recovery of a digitized episode of a prominent 1980s TV talk show.</p>
</sec>
</sec>
</sec>
<sec id="sec5">
<title>Discussion</title>
<p>The analysis of these themes reveals where the activities and capabilities of data hoarding converge with, or diverge from, digital curation within memory institutions. Compared to the DCC lifecycle, the practice of data hoarding begins with a focus on storage, rather than pre-ingest activities of appraisal and selection. Data-hoarders believe everything that you can get your hands on can and should be hoarded, and the only limit is how many TB hard drives you can buy. An interesting point of comparison is also seen in how hoarding addresses the DCC lifecycle&#x2019;s activities of <italic>&#x2018;create or receive&#x2019;</italic>; for data hoarding practice, these activities are much more active than passive, requiring considerable workarounds as data-hoarders seek out and capture materials through illicit means. Data-hoarders take a unique perspective on <italic>&#x2018;preservation actions&#x2019;</italic> compared to institutional practices concerned with migration, emulation, and fixity checks. Instead, data-hoarders preserve through torrents, seeding, and re-seeding, embracing the idea of, <italic>&#x2018;lots of copies keep stuff safe&#x2019;.</italic> From this perspective, access and preservation are much more closely aligned, or even seen as the same activity, enabled by their work outside of organizational policies and concerns with copyright.</p>
<p>We also observe that data hoarding practices are often explicitly driven by personal motivations of nostalgia and political stances. This presents an interesting contrast to curation practice in organizational settings, which can also, equally be influenced by personal interests of curators, though this may not be openly acknowledged due to a perceived need for institutional neutrality. The way data hoarding outwardly accepts the personal nature of memory work and collecting is a welcome change.</p>
<p>The key themes described above represent an initial analysis of how the r/DataHoarder community defines and constructs practices of data hoarding. We note that this preliminary work has several limitations, particularly in the selection of a dataset based on top posts. Many of the more everyday routines, as well as processes of enrollment and enculturation may not be present in our selected dataset. In future work we aim to study the <italic>&#x2018;long tail&#x2019;</italic> dataset of posts with significantly less engagement and employ ethnographic methods to generate rich descriptive observations to understand the values and ethics driving this practice. Additional posts studied being to reveal processes of identity formation through r/data-hoarder, seen in posts with sentiments such as <italic>&#x2018;this is why I hoard.&#x2019;</italic> This identity can also become all-consuming and challenging for posters who subsequently describe a need to stop hoarding or describe how hoarding affects the relationship with their significant other. The role of gender within this community, and the prevalence of pornography as a prevalent object of hoarding is another topic, we have not been able to analyze in detail in this preliminary work. Through subsequent phases of this study, we also hope to address if or how data hoarding distinguishes between data and other digital objects (like the articles, books, media, and games that are often the subject of hoarding). Ultimately, by comparing the work of data-hoarders with the work of curation in memory institutions, and what each is able to target in their collections, we aim to understand what gaps and absences remain.</p>
</sec>
<sec id="sec6">
<title>Conclusion</title>
<p>This work-in-progress highlights how r/DataHoarder offers a view of digital curation practice focused on the materiality of digital storage, and calls-to-action that raise public awareness of the potentially irrevocable loss of digital information. This presents useful counterpoints to commonly referenced and standardized practices of memory institutions by focusing on how communities conceptualize, practice, and develop their own ethical framework toward sustained access of digital materials. In future work we hope to address many questions and topics out-of-scope for this paper, such as expanding to other communities, comparing across communities, and applying both etic and emic coding to community posts.</p>
</sec>
</body>
<back>
<ack>
<title>Acknowledgements</title>
<p>The authors thank the anonymous reviewers for their comments and suggestions.</p>
</ack>
<ref-list>
<title>References</title>
<ref id="R1"><element-citation publication-type="other"><person-group person-group-type="author"><name><surname>Basu</surname><given-names>T.</given-names></name></person-group><year>2021</year><comment>January 8</comment><article-title>The scramble to archive Capitol insurrection footage before it disappears</article-title><source>MIT Technology Review</source><ext-link ext-link-type="uri" xlink:href="https://web.archive.org/web/20210108224223/https://www.technologyreview.com/2021/01/08/1015929/archive-capitol-insurrection-trump-maga-footage/">https://web.archive.org/web/20210108224223/https://www.technologyreview.com/2021/01/08/1015929/archive-capitol-insurrection-trump-maga-footage/</ext-link></element-citation></ref>
<ref id="R2"><element-citation publication-type="book"><person-group person-group-type="author"><name><surname>Bowker</surname><given-names>G. C.</given-names></name><name><surname>Star</surname><given-names>S. L.</given-names></name></person-group><year>2000</year><source>Sorting Things Out: Classification and Its Consequences</source><publisher-name>MIT Press</publisher-name></element-citation></ref>
<ref id="R3"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Braun</surname><given-names>V.</given-names></name><name><surname>Clarke</surname><given-names>V.</given-names></name></person-group><year>2006</year><article-title>Using thematic analysis in psychology</article-title><source>Qualitative Research in Psychology</source><volume>3</volume><issue>2</issue><fpage>77</fpage><lpage>101</lpage><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1191/1478088706qp063oa">https://doi.org/10.1191/1478088706qp063oa</ext-link></element-citation></ref>
<ref id="R4"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Britt</surname><given-names>B. C.</given-names></name><name><surname>Britt</surname><given-names>R. K.</given-names></name><name><surname>Hayes</surname><given-names>J. L.</given-names></name><name><surname>Oh</surname><given-names>J.</given-names></name></person-group><year>2022</year><article-title>Continuing a community of practice beyond the death of its domain: Examining the Tales of Link subreddit</article-title><source>Behaviour &#x0026; Information Technology</source><volume>41</volume><issue>1</issue><fpage>159</fpage><lpage>180</lpage><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1080/0144929X.2020.1797173">https://doi.org/10.1080/0144929X.2020.1797173</ext-link></element-citation></ref>
<ref id="R5"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Brown</surname><given-names>J. S.</given-names></name><name><surname>Duguid</surname><given-names>P.</given-names></name></person-group><year>1991</year><article-title>Organizational Learning and Communities-of-Practice: Toward a Unified View of Working, Learning, and Innovation</article-title><source>Organization Science</source><volume>2</volume><issue>1</issue><fpage>40</fpage><lpage>57</lpage></element-citation></ref>
<ref id="R6"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Brown</surname><given-names>J. S.</given-names></name><name><surname>Duguid</surname><given-names>P.</given-names></name></person-group><year>2001</year><article-title>Knowledge and Organization: A Social-Practice Perspective</article-title><source>Organization Science</source><volume>12</volume><issue>2</issue><fpage>198</fpage><lpage>213</lpage><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1287/orsc.12.2.198.10116">https://doi.org/10.1287/orsc.12.2.198.10116</ext-link></element-citation></ref>
<ref id="R7"><element-citation publication-type="other"><person-group person-group-type="author"><name><surname>Chapman</surname><given-names>E. M.</given-names></name></person-group><year>2023</year><article-title>ARCHIVING THE INSURRECTION: THE CASE OF R/DATAHOARDER. AoIR Selected Papers of Internet Research</article-title><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.5210/spir.v2022i0.12987">https://doi.org/10.5210/spir.v2022i0.12987</ext-link></element-citation></ref>
<ref id="R8"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Cox</surname><given-names>A.</given-names></name></person-group><year>2005</year><article-title>What are communities of practice? A comparative review of four seminal works</article-title><source>Journal of Information Science</source><volume>31</volume><issue>6</issue><fpage>527</fpage><lpage>540</lpage><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1177/0165551505057016">https://doi.org/10.1177/0165551505057016</ext-link></element-citation></ref>
<ref id="R9"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Dallas</surname><given-names>C.</given-names></name></person-group><year>2016</year><article-title>Digital curation beyond the &#x201C;wild frontier&#x201D;: A pragmatic approach</article-title><source>Archival Science</source><volume>16</volume><issue>4</issue><fpage>421</fpage><lpage>457</lpage><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1007/s10502-015-9252-6">https://doi.org/10.1007/s10502-015-9252-6</ext-link></element-citation></ref>
<ref id="R10"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Grandhi</surname><given-names>S. A.</given-names></name><name><surname>Plotnick</surname><given-names>L.</given-names></name><name><surname>Hiltz</surname><given-names>S. R.</given-names></name></person-group><year>2020</year><article-title>An Internet-less World? Expected Impacts of a Complete Internet Outage with Implications for Preparedness and Design</article-title><source>Proc. ACM Hum.- Comput. Interact.</source><volume>4</volume><issue>GROUP</issue><comment>03:1-03:24</comment><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1145/3375183">https://doi.org/10.1145/3375183</ext-link></element-citation></ref>
<ref id="R11"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Graves</surname><given-names>R. L.</given-names></name><name><surname>Perrone</surname><given-names>J.</given-names></name><name><surname>Al-Garadi</surname><given-names>M. A.</given-names></name><name><surname>Yang</surname><given-names>Y.-C.</given-names></name><name><surname>Love</surname><given-names>J.</given-names></name><name><surname>O&#x2019;Connor</surname><given-names>K.</given-names></name><name><surname>Gonzalez-Hernandez</surname><given-names>G.</given-names></name><name><surname>Sarker</surname><given-names>A.</given-names></name></person-group><year>2022</year><article-title>Thematic Analysis of Reddit Content About Buprenorphine&#x2013;naloxone Using Manual Annotation and Natural Language Processing Techniques</article-title><source>Journal of Addiction Medicine</source><volume>16</volume><issue>4</issue><fpage>454</fpage><lpage>460</lpage><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1097/ADM.0000000000000940">https://doi.org/10.1097/ADM.0000000000000940</ext-link></element-citation></ref>
<ref id="R12"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Haythornthwaite</surname><given-names>C.</given-names></name><name><surname>Kumar</surname><given-names>P.</given-names></name><name><surname>Gruzd</surname><given-names>A.</given-names></name><name><surname>Gilbert</surname><given-names>S.</given-names></name><name><surname>Esteve del Valle</surname><given-names>M.</given-names></name><name><surname>Paulin</surname><given-names>D.</given-names></name></person-group><year>2018</year><article-title>Learning in the wild: Coding for learning and practice on Reddit</article-title><source>Learning, Media, and Technology</source><volume>43</volume><issue>3</issue><fpage>219</fpage><lpage>235</lpage><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1080/17439884.2018.1498356">https://doi.org/10.1080/17439884.2018.1498356</ext-link></element-citation></ref>
<ref id="R13"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Hills</surname><given-names>M.</given-names></name></person-group><year>2015</year><article-title>The expertise of digital fandom as a &#x2018;community of practice&#x2019;: Exploring the narrative universe of Doctor Who</article-title><source>Convergence</source><volume>21</volume><issue>3</issue><fpage>360</fpage><lpage>374</lpage><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1177/1354856515579844">https://doi.org/10.1177/1354856515579844</ext-link></element-citation></ref>
<ref id="R14"><element-citation publication-type="other"><person-group person-group-type="author"><name><surname>Hudgins</surname><given-names>W.</given-names></name><name><surname>Lynch</surname><given-names>M.</given-names></name><name><surname>Schmal</surname><given-names>A.</given-names></name><name><surname>Sikka</surname><given-names>H.</given-names></name><name><surname>Swenson</surname><given-names>M.</given-names></name><name><surname>Joyner</surname><given-names>D. A.</given-names></name></person-group><year>2020</year><article-title>Informal Learning Communities: The Other Massive Open Online &#x2018;C.&#x2019;</article-title><source>Proceedings of the Seventh ACM Conference on Learning @ Scale</source><fpage>91</fpage><lpage>101</lpage><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1145/3386527.3405926">https://doi.org/10.1145/3386527.3405926</ext-link></element-citation></ref>
<ref id="R15"><element-citation publication-type="other"><person-group person-group-type="author"><name><surname>Kwon</surname><given-names>K. H.</given-names></name><name><surname>Yu</surname><given-names>W.</given-names></name><name><surname>Kilar</surname><given-names>S.</given-names></name><name><surname>Shao</surname><given-names>C.</given-names></name><name><surname>Broussard</surname><given-names>K.</given-names></name><name><surname>Lutes</surname><given-names>T.</given-names></name></person-group><year>2020</year><article-title>Knowledge Sharing Network in a Community of Illicit Practice: A Cybermarket Subreddit Case</article-title><source>Proceedings of the 53rd Hawaii International Conference on Systems Sciences</source><fpage>2731</fpage><lpage>2740</lpage><ext-link ext-link-type="uri" xlink:href="http://hdl.handle.net/10125/64076">http://hdl.handle.net/10125/64076</ext-link></element-citation></ref>
<ref id="R16"><element-citation publication-type="book"><person-group person-group-type="author"><name><surname>Lave</surname><given-names>J.</given-names></name><name><surname>Wenger</surname><given-names>E.</given-names></name></person-group><year>1991</year><source>Situated Learning: Legitimate Peripheral Participation (1st ed.)</source><publisher-name>Cambridge University Press</publisher-name><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1017/CBO9780511815355">https://doi.org/10.1017/CBO9780511815355</ext-link></element-citation></ref>
<ref id="R17"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>LeBlanc</surname><given-names>Z.</given-names></name><name><surname>Janco</surname><given-names>A.</given-names></name><name><surname>Wermer-Colan</surname><given-names>A.</given-names></name><name><surname>Dombrowski</surname><given-names>Q.</given-names></name><name><surname>Kijas</surname><given-names>A.</given-names></name><name><surname>Majstorovic</surname><given-names>S.</given-names></name><name><surname>Strong</surname><given-names>D.</given-names></name><name><surname>Peaslee</surname><given-names>E.</given-names></name></person-group><year>2022</year><article-title>A Conversation with the Organizers of Saving Ukrainian Cultural Heritage Online (SUCHO)</article-title><source>Journal of Library Outreach and Engagement</source><volume>2</volume><issue>1</issue><comment>Article 1</comment><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.21900/j.jloe.v2i1.969">https://doi.org/10.21900/j.jloe.v2i1.969</ext-link></element-citation></ref>
<ref id="R18"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Lundeen</surname><given-names>L. A.</given-names></name><name><surname>McCall</surname><given-names>J. R.</given-names></name><name><surname>Bradshaw</surname><given-names>A. S.</given-names></name><name><surname>LeBlanc</surname><given-names>E. L.</given-names></name><name><surname>Humphrey</surname><given-names>L.</given-names></name></person-group><year>2024</year><article-title>Digital Catharsis or Harmful Exposure? A Thematic Analysis of Self-Directed Violence Reddit Posts</article-title><source>Social Media + Society</source><volume>10</volume><issue>3</issue><fpage>20563051241263562</fpage><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1177/20563051241263562">https://doi.org/10.1177/20563051241263562</ext-link></element-citation></ref>
<ref id="R19"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Maxwell</surname><given-names>D.</given-names></name><name><surname>Robinson</surname><given-names>S. R.</given-names></name><name><surname>Williams</surname><given-names>J. R.</given-names></name><name><surname>Keaton</surname><given-names>C.</given-names></name></person-group><year>2020</year><article-title>&#x201C;A Short Story of a Lonely Guy&#x201D;: A Qualitative Thematic Analysis of Involuntary Celibacy Using Reddit</article-title><source>Sexuality &#x0026; Culture</source><volume>24</volume><issue>6</issue><fpage>1852</fpage><lpage>1874</lpage><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1007/s12119-020-09724-6">https://doi.org/10.1007/s12119-020-09724-6</ext-link></element-citation></ref>
<ref id="R20"><element-citation publication-type="book"><person-group person-group-type="author"><name><surname>McMeekin</surname><given-names>S.</given-names></name><name><surname>Currie</surname><given-names>A.</given-names></name></person-group><year>2022</year><source>DPC Competency Audit Toolkit Guide (1st ed.)</source><publisher-name>Digital Preservation Coalition</publisher-name><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.7207/dpccat22-01">https://doi.org/10.7207/dpccat22-01</ext-link></element-citation></ref>
<ref id="R21"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Moran</surname><given-names>J.</given-names></name><name><surname>Feltham</surname><given-names>F.</given-names></name><name><surname>Love</surname><given-names>V.</given-names></name></person-group><year>2019</year><article-title>Building an Aotearoa New Zealand-wide Digital Curation Community of Practice</article-title><source>International Journal of Digital Curation</source><volume>14</volume><issue>1</issue><comment>Article 1</comment><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.2218/ijdc.v14i1.638">https://doi.org/10.2218/ijdc.v14i1.638</ext-link></element-citation></ref>
<ref id="R22"><element-citation publication-type="other"><person-group person-group-type="author"><collab>National Digital Stewardship Alliance (NDSA)</collab></person-group><year>2019</year><article-title>2019 Levels of Digital Preservation Matrix</article-title><ext-link ext-link-type="uri" xlink:href="https://osf.io/2mkwx/">https://osf.io/2mkwx/</ext-link></element-citation></ref>
<ref id="R23"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Ogden</surname><given-names>J.</given-names></name></person-group><year>2022</year><article-title>&#x201C;Everything on the internet can be saved&#x201D;: Archive Team, Tumblr, and the cultural significance of web archiving</article-title><source>Internet Histories</source><volume>6</volume><issue>1&#x2013;2</issue><fpage>1</fpage><lpage>20</lpage><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1080/24701475.2021.1985835">https://doi.org/10.1080/24701475.2021.1985835</ext-link></element-citation></ref>
<ref id="R24"><element-citation publication-type="book"><person-group person-group-type="author"><name><surname>Panek</surname><given-names>E. T.</given-names></name></person-group><year>2022</year><source>Understanding Reddit</source><publisher-name>Routledge</publisher-name></element-citation></ref>
<ref id="R25"><element-citation publication-type="other"><person-group person-group-type="author"><name><surname>Peltzman</surname><given-names>S.</given-names></name><name><surname>Dietz</surname><given-names>B.</given-names></name><name><surname>Butler</surname><given-names>D.</given-names></name><name><surname>Walker</surname><given-names>P.</given-names></name><name><surname>Farrell</surname><given-names>J.</given-names></name><name><surname>Arroyo-Ramirez</surname><given-names>E.</given-names></name><name><surname>Macquarie</surname><given-names>C.</given-names></name><name><surname>Bolding</surname><given-names>K.</given-names></name><name><surname>Helms</surname><given-names>A.</given-names></name><name><surname>Venlet</surname><given-names>J.</given-names></name><name><surname>Cobourn</surname><given-names>A.</given-names></name><name><surname>Watson</surname><given-names>C.</given-names></name><name><surname>Taylor</surname><given-names>A.</given-names></name><name><surname>Henke</surname><given-names>K.</given-names></name></person-group><year>2022</year><article-title>Levels of Born- Digital Access</article-title><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.17605/OSF.IO/R5F78">https://doi.org/10.17605/OSF.IO/R5F78</ext-link></element-citation></ref>
<ref id="R26"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Proferes</surname><given-names>N.</given-names></name><name><surname>Jones</surname><given-names>N.</given-names></name><name><surname>Gilbert</surname><given-names>S.</given-names></name><name><surname>Fiesler</surname><given-names>C.</given-names></name><name><surname>Zimmer</surname><given-names>M.</given-names></name></person-group><year>2021</year><article-title>Studying Reddit: A Systematic Overview of Disciplines, Approaches, Methods, and Ethics</article-title><source>Social Media + Society</source><volume>7</volume><issue>2</issue><fpage>20563051211019004</fpage><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1177/20563051211019004">https://doi.org/10.1177/20563051211019004</ext-link></element-citation></ref>
<ref id="R27"><element-citation publication-type="other"><person-group person-group-type="author"><collab>r/DataHoarder</collab></person-group><year>2017</year><comment>July 30</comment><article-title>A quick Datahoarder FAQ [Reddit Post]</article-title><ext-link ext-link-type="uri" xlink:href="https://web.archive.org/web/20171211044900/www.reddit.com/r/DataHoarder/comments/6qf716/a_quick_datahoarder_faq/">https://web.archive.org/web/20171211044900/www.reddit.com/r/DataHoarder/comments/6qf716/a_quick_datahoarder_faq/</ext-link></element-citation></ref>
<ref id="R28"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Rios</surname><given-names>F.</given-names></name><name><surname>Lassere</surname><given-names>M.</given-names></name><name><surname>Ruggill</surname><given-names>J. E.</given-names></name><name><surname>McAllister</surname><given-names>K. S.</given-names></name></person-group><year>2020</year><article-title>Sustaining Software Preservation Efforts Through Use and Communities of Practice</article-title><source>International Journal of Digital Curation</source><volume>15</volume><issue>1</issue><comment>Article 1</comment><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.2218/ijdc.v15i1.696">https://doi.org/10.2218/ijdc.v15i1.696</ext-link></element-citation></ref>
<ref id="R29"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Walker</surname><given-names>D.</given-names></name><name><surname>Nost</surname><given-names>E.</given-names></name><name><surname>Lemelin</surname><given-names>A.</given-names></name><name><surname>Lave</surname><given-names>R.</given-names></name><name><surname>Dillon</surname><given-names>L.</given-names></name></person-group><year>2018</year><article-title>Practicing environmental data justice: From DataRescue to Data Together</article-title><source>Geo: Geography and Environment</source><volume>5</volume><issue>2</issue><fpage>e00061</fpage><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1002/geo2.61">https://doi.org/10.1002/geo2.61</ext-link></element-citation></ref>
<ref id="R30"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Wenger</surname><given-names>E.</given-names></name></person-group><year>1998</year><article-title>Communities of Practice: Learning as a Social System</article-title><source>Systems Thinker</source><volume>9</volume><issue>5</issue><fpage>2</fpage><lpage>3</lpage></element-citation></ref>
<ref id="R31"><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Wenger</surname><given-names>E.</given-names></name></person-group><year>2000</year><article-title>Communities of Practice and Social Learning Systems</article-title><source>Organization</source><volume>7</volume><issue>2</issue><fpage>225</fpage><lpage>246</lpage><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1177/135050840072002">https://doi.org/10.1177/135050840072002</ext-link></element-citation></ref>
</ref-list>
</back>
</article>