Commit Graph

62 Commits

Author SHA1 Message Date
Julien Nioche
82ac314103 added StormCrawler (#66) 2019-06-25 10:30:45 -04:00
Lauren Ko
34b2b5207f Add py-wasapi-client (#65) 2019-03-29 17:40:33 -04:00
Nick Ruest
905ed00ab9 Add auk-notebooks. (#64) 2019-03-08 14:16:36 +00:00
Andy Jackson
b28c9c726d Adding a QA section & more detailed TOC (#63) 2019-03-07 10:22:50 -05:00
Ross Spencer
e66c872e6f Add wikiteam to utilities (#62) 2019-03-02 13:20:07 -05:00
hook
eff8911af8 Updated grab-site repository URL (#61) 2019-03-02 13:19:19 -05:00
Ryan Patterson
78b14e6aed Add link to Chronicler (#60)
Add information about Chronicler to the "acquisition" section.
2019-01-14 17:57:45 -05:00
Nick Sweeting
7930a983be Rename Bookmark Archiver to new name ArchiveBox (#59) 2018-12-22 00:44:01 +00:00
Andy Jackson
826dae2972 Add links to WARC specification sites (#58) 2018-11-30 12:25:49 -05:00
Laura Wrubel
e1811a6cd5 Adding Social Feed Manager (#57) 2018-11-30 08:31:09 -05:00
Peter Krantz
285006a76d Added Warcworker (#56)
* Added Warcworker
2018-11-26 08:10:21 -05:00
Ian Milligan
4976c2b592 Adds Archives Unleashed Cloud and updates AUT (#55) 2018-11-16 19:22:49 -05:00
raffaele messuti
e2cde6b83f new tools: crawl and wasp (#54) 2018-11-12 15:20:39 -05:00
Lars
494912c939 Add crocoite (#53) 2018-11-10 10:46:24 -05:00
Andy Jackson
a89a159a01 Moved webarchive-discovery associated tools to be together (#50)
As suggested in https://github.com/iipc/awesome-web-archiving/pull/47#issue-195740023 this moves the `webarchive-discovery`-related tools under a `webarchive-discovery` section.
2018-10-16 07:48:46 -04:00
Andy Jackson
2d394f9a49
Create section for guidance for web publishers (#49)
* Some clean up and added Slack.

* Separate the basic and mroe advanced stuff, and add the intro video in.

* Added some new links and detail responding to #22.

* Add specific section for web publishers.
2018-10-16 12:27:37 +01:00
Toke Eskildsen
9e6d936a82 Added SolrWayback (#47)
* Added SolrWayback for both replay and discovery

* Removed SolrWayback from playback as it was confusing to list it under two different headings on the same page
2018-10-16 12:27:01 +01:00
Alex Osborne
c15b3c97e8
Heritrix wiki has moved to Github 2018-07-06 09:06:39 +09:00
Nick Sweeting
42e97d36ac Add Bookmark-Archiver (#46) 2018-06-24 16:00:21 -04:00
Alex Osborne
878d982775 Add OutbackCDX (#45) 2018-05-23 10:27:56 +01:00
Mat Kelly
9fcff939e3 Add ArchiveTools per #42 (#44)
Ping @ruebot
2018-05-15 12:18:27 -04:00
Ian Milligan
c6f6b4656d Updating old documentation links to new ones (#43) 2018-04-06 09:16:30 -04:00
IAMONSYS GmbH
063e0f2f35 Adding securitytrails archive (#41) 2018-03-14 08:51:20 -04:00
Jeremy Cahill
7964fca0e7 Move ArchiveFacebook to Deprecated (#40)
Project's FF addons page is disabled. Source appears to have been ported from Google Code in anticipation of updates that didn't materialize: https://groups.google.com/forum/?hl=en#!topic/archivefacebook/_m8KeOTnBng
2017-12-07 20:58:47 -05:00
Nick Ruest
03aa9703fd Update warclight (#35)
* Update warclight

* s/warcbase/aut/
2017-09-17 16:26:29 +01:00
Ashley
4013e4b8e2 rm duplicate link to awesome-momento (#39) 2017-09-16 14:50:09 -04:00
Ross Spencer
06b47c0f23 Added HTTPreserve Workbench (#37)
* Added HTTPreserve Workbench.

* Added language to HTTPreserve Workbench.
2017-09-03 19:18:11 -04:00
Ross Spencer
7e9671f411 Added HTTPreserve tikalinkextract. (#36) 2017-09-03 09:41:30 -04:00
Ross Spencer
17a41aca7e Added httpreserve.info (#38) 2017-09-03 09:40:43 -04:00
Patrick Connolly
11a60a2301 Added Archivers slack team. (#34) 2017-08-13 18:48:50 -04:00
Ian Milligan
5cf01e48df New sign-up process for Archives Unleashed slack (#33)
Changes from e-mail @ianmilligan1 to the @ruebot-created form.
2017-08-11 07:55:50 -04:00
Helge Holzmann
80deff9b4b Add Tempas v1 and v2 (#32)
* Add Tempas v1

* Add Tempas v2
2017-07-26 08:15:54 -04:00
John Berlin
63a410126d Add node-cdxj to the list (#31) 2017-07-24 23:02:15 -04:00
John Berlin
2d46142baa Updated node-warcs entry in the list to reflect http://ws-dl.blogspot.com/2017/07/2017-07-24-replacing-heritrix-with.html and WAILs + Squidwarcs usage of this library (#30) 2017-07-24 22:25:18 -04:00
John Berlin
3505b572dd Add Squidwarc to the list (#29) 2017-07-24 22:24:44 -04:00
Mohamed Aturban
d7fd3167a2 Add archivenow to the list (#28) 2017-07-10 13:05:18 +01:00
Mat Kelly
31389f46b9 Updates to PR#24 by @kant as recommended by @ruebot (#27)
* Minor fixes

* Changes per @ruebot in PR#14
2017-07-08 09:54:17 -04:00
Mat Kelly
c5b04a33e8 Add InterPlanetary Wayback (#26)
I deliberated under which category this should fit but replay seems most appropriate.
2017-07-07 21:55:29 +01:00
Mat Kelly
5dc48f29b8 Fix spelling (#25) 2017-07-07 15:21:25 +01:00
Andy Jackson
d820480d6e Add some links to other resources and clarifications (#23)
* Some clean up and added Slack.

* Separate the basic and mroe advanced stuff, and add the intro video in.

* Added some new links and detail responding to #22.
2017-06-26 16:38:34 -04:00
Mat Kelly
15b429d288 Update README.md (#21) 2017-06-23 12:57:54 -04:00
nruest
38b2694985
Test for http://netpreserve.org/web-archiving/tools-and-software embed 2017-06-22 16:08:03 -04:00
Mat Kelly
2a8288928e Rm superfluous paren (#20) 2017-06-21 17:17:45 -04:00
raffaele messuti
107fb052a3 add warcio, warctools, har2warc, node-warc, go webarchive (#19)
* warcat: still in utilities

* add webarchive-indexing

* add The Archive Browser

* add warcio, warctools, har2warc, node-warc, go webarchive
2017-06-21 16:01:56 -04:00
Nick Ruest
babfbac355 Add Heritrix Walkthrough (#18) 2017-06-20 10:05:35 +01:00
Mat Kelly
1634b60c96 Add "The Unarchiver" app. (#17)
A free variant of the already included "The Archive Browser" limited to the extraction features.
2017-06-19 11:28:32 -04:00
Kristinn Sigurðsson
d52d478000 Update README.md
Fixed incorrect alphabetical ordering of item.
2017-06-19 10:55:54 +00:00
Steffen
c370e303dc added html2warc (#16)
added html2warc, a simple script to convert offline data into a single warc file
2017-06-18 10:26:35 -04:00
raffaele messuti
4e413e2342 add webarchive-index and "the archive browser", remove warccat duplicate (#15)
* warcat: still in utilities

* add webarchive-indexing

* add The Archive Browser
2017-06-17 16:51:37 -04:00
Nick Ruest
c3658d76da Search & discovery (#14) 2017-06-17 13:08:44 +01:00