Just noticed this today - seems all the archiving activity has been noticed by NCBI / NLM staff. Thankfully most of SRA (the Sequence Read Archive) and other genomic data is also mirrored in Europe.

  • @pansapiens@lemmy.sdf.orgOP
    link
    fedilink
    English
    132 months ago

    From watching the ArchiveTeam’s Warrior URLs as they stream past, it looks like PubMed Central manuscripts are being archived, which is a good thing.

  • @taiidan@slrpnk.net
    link
    fedilink
    22 months ago

    That’s a lot of data to be archiving! What’s the archiving action responsible for this, or what group? I work with SRA and GEO daily for work, so this is interesting to see on lemmy.

    • @pansapiens@lemmy.sdf.orgOP
      link
      fedilink
      English
      32 months ago

      It looks like ArchiveTeam’s Warrior was mostly capturing PubMedCentral (PMC) articles. As far as I know, SRA and GEO aren’t being backed up by ArchiveTeam (that is a lot of data), but since SRA is largely also mirrored by ENA, it wouldn’t seem a priority.

      • @taiidan@slrpnk.net
        link
        fedilink
        12 months ago

        Didn’t know about ENA mirroring. Thanks! I’m tickled by the idea that all the paywalled journals are not backed up. If we ever have a planet wide catastrophe, we’ll have to rebuild using the open articles only!

    • @pansapiens@lemmy.sdf.orgOP
      link
      fedilink
      English
      12 months ago

      It looks like ArchiveTeam’s Warrior was mostly capturing PubMedCentral (PMC) articles. As far as I know, SRA and GEO aren’t being backed up by ArchiveTeam (that is a lot of data), but since SRA is largely also mirrored by ENA, it wouldn’t seem a priority.