From e696a5e09b043f1d85c8b0b74af02cf9d73146ed Mon Sep 17 00:00:00 2001 From: Nick Ruest Date: Tue, 25 Feb 2020 10:27:36 -0500 Subject: [PATCH] Updates for Archives Unleashed projects. (#78) - Deprecate AUK notebooks - Add AU Notebooks - Add twut - Update the description for aut - Update documentation link --- README.md | 7 ++++--- deprecated.md | 1 + 2 files changed, 5 insertions(+), 3 deletions(-) diff --git a/README.md b/README.md index 22ec158..ab57b22 100644 --- a/README.md +++ b/README.md @@ -30,7 +30,7 @@ Web archiving, an effort to preserve the Web for future generations. * The [offical ISO 28500 WARC specification homepage](http://bibnum.bnf.fr/WARC/). * More advanced material: * [Awesome Memento](https://github.com/machawk1/awesome-memento) - * [Archives Unleashed Toolkit documentation](http://archivesunleashed.org/aut/) + * [Archives Unleashed Toolkit documentation](https://github.com/archivesunleashed/aut-docs) * [Heritrix Walkthrough](https://github.com/web-archive-group/heritrix-walkthrough) (In development) * [The WARC Ecosystem](http://www.archiveteam.org/index.php?title=The_WARC_Ecosystem) * [The Web Crawl section of COPTR](http://coptr.digipres.org/Category:Web_Crawl) @@ -137,8 +137,9 @@ This list of tools and software is intended to briefly describe some of the most * [ArchiveSpark](https://github.com/helgeho/ArchiveSpark) - An Apache Spark framework (not only) for Web Archives that enables easy data processing, extraction as well as derivation. (Stable) * [Archives Unleashed Cloud](https://cloud.archivesunleashed.org) - Archives Unleashed Cloud (AUK) is an web interface for analysing web archives. Currently, it can sync with Archive-It collections and extract hyperlink networks, full text, and other information from your collections. (Stable) -* [Archives Unleashed Notebooks](https://github.com/archivesunleashed/auk-notebooks) - Jupyter notebooks to assist in creating additional analysis and visualizations of Archives Unleashed Cloud derivatives. -* [Archives Unleashed Toolkit](https://github.com/archivesunleashed/aut) - Archives Unleashed Toolkit (AUT) is an open-source platform for analyzing web archives. (Stable) +* [Archives Unleashed Notebooks](https://github.com/archivesunleashed/notebooks) - Notebooks for working with web archives with the Archives Unleashed Toolkit, and derivatives generated by the Archives Unleashed Toolkit. (Stable) +* [Archives Unleashed Toolkit](https://github.com/archivesunleashed/aut) - Archives Unleashed Toolkit (AUT) is an open-source platform for analyzing web archives with Apache Spark. (Stable) +* [Tweet Archvies Unleashed Toolkit](https://github.com/archivesunleashed/twut) - An open-source toolkit for analyzing line-oriented JSON Twitter archives with Apache Spark. (In Development) ### Quality Assurance diff --git a/deprecated.md b/deprecated.md index 9f2576c..dacf1f9 100644 --- a/deprecated.md +++ b/deprecated.md @@ -1,5 +1,6 @@ # Deprecated/Abandoned Items * [ArchiveFacebook](https://addons.mozilla.org/en-US/firefox/addon/archivefacebook/) - (Abandoned) - A [Mozilla Firefox](https://www.mozilla.org/en-US/firefox/new/) add-on for individuals to archive their Facebook accounts. +* [Archives Unleashed Notebooks](https://github.com/archivesunleashed/auk-notebooks) - Jupyter notebooks to assist in creating additional analysis and visualizations of Archives Unleashed Cloud derivatives. * [pywb Wayback Web Recorder (Archiver)](https://github.com/ikreymer/pywb-webrecorder) - (Sunsetted) - A bare-bones example of how to create a simple web recording and replay system. * [Warrick](https://github.com/oduwsdl/warrick) - (Unmaintained) - An open source tool for reconstructing lost websites from web archives using [Memento](http://mementoweb.org/).