Recent Releases of eprints2archives
eprints2archives - Release 1.3.5
Changes in this version:
* Fix bug in definition of Service class in services/base.py.
* Update some package versions in requirements.txt.
* Pin more versions of packages in requirements.txt.
* Change name of runnable program to have dash between the name eprints2archives and the version number.
* Updated copyright year in file headers and other places.
- Python
Published by mhucka almost 3 years ago
eprints2archives - Release 1.3.4
Changes in this version:
* Fix handling of occasionally unexpected timemap result from InternetArchive.
* Update versions of some dependencies in requirements.txt.
* Add of CITATION.cff file.
- Python
Published by mhucka over 3 years ago
eprints2archives - Release 1.3.3
Changes in this release (relative to previous release)
- Define
console_scriptsfor setuptools to produce a better wrapper script. - Update all requirement versions to latest versions of packages. This was not done for over a year and it showed...
- Add requirement for a certain package imported by another requirement but that does not, for some reason, get imported properly when a clean virtual environment is created in the process of making pyz apps.
New: self-contained binary distributions
This release includes standalone, runnable binaries for macOS and Linux. These experimental binaries were built using shiv. They only need a copy of Python 3.6, 3.7, 3.8 or 3.9 on your system to run – they don't require installation of additional Python packages using (e.g.) pip. They should work on macOS 10.13–11.1, Ubuntu 18+, CentOS 7.5, and probably others. To use them:
Download the zip file for either the Python 3.6, 3.7, 3.8 or the 3.9 version, depending on the version of Python you have installed on your computer. (Use
python3 --versionto find out the version you have.)Unzip the resulting file (if your web browser did not automatically unzip it).
Move the file
eprints2archivesto a location where your terminal shell looks for programs. A popular location is/usr/local/bin.
Thereafter, you should be able to type eprints2archives in your shell and it should run. (If it does not, please report the problem using the issue tracker.) Note that the first time you run it, eprints2archives will take longer than usual to start because it does a one-time configuration step, but on subsequent runs, startup times should be shorter.
- Python
Published by mhucka over 4 years ago
eprints2archives - Release 1.3.2
Changes in this release
- Fix a mixhandled exception when a server returns a code 500.
- Improve catching interrupts on Windows.
- Use CommonPy network utilities, file utilities, and data utilities instead of internal copies.
- Use Bun user interface code instead of internal copy. Also change some colors of messages printed by
eprints2archives. - Update internal imports and some requirements.
- Update copyright year.
New: self-contained binary distributions
This release includes standalone, runnable binaries for macOS. These experimental binaries were built using shiv. They only need a copy of Python 3.7, 3.8 or 3.9 on your system to run – they don't require installation of additional Python packages using (e.g.) pip. They should work on macOS 10.13–11.1. To use them:
Download the zip file for either the Python 3.7, 3.8 or the 3.9 version, depending on the version of Python you have installed on your computer
Unzip the resulting file (if your web browser did not automatically unzip it)
Move the file
eprints2archivesto a location where your terminal shell looks for programs. A popular location is/usr/local/bin.
Thereafter, you should be able to type eprints2archives in your shell and it should run. (If it does not, please report the problem using the issue tracker.) Note that the first time you run it, eprints2archives will take longer than usual to start because it does a one-time configuration step, but on subsequent runs, startup times should be shorter.
- Python
Published by mhucka over 4 years ago
eprints2archives - Release 1.3.1
This version has no functional or user-interface changes. It brings only some minor documentation updates to README.md and the following internal changes:
- Use Sidetrack instead of internal
debug.pyversion of the same. - Internally, use different approach to recording version number and other metadata.
- Use updated release procedure codified in
Makefile.
The file CHANGES contains a more complete change log that includes information about previous releases.
- Python
Published by mhucka over 5 years ago
eprints2archives - Version 1.3.0
The following are the main changes in this release:
- Check that URLs obtained from EPrints records appear to be valid URLs, before trying to send them to web archives. (This is mostly to catch bad values in the
official_urlrecord field.) - Be more careful about which
/view/X/N.htmlpages are sent. - Do a better job with HTTP code 400 from Internet Archive.
- Add some more debug log statements.
- Some more internal network code refactoring.
The file CHANGES contains a more complete change log that includes information about previous releases.
- Python
Published by mhucka over 5 years ago
eprints2archives - Version 1.2.2 – Minor bug fix
Changes in this release:
- Retry network operations one time if get HTTP code 400.
- Internal network code refactoring.
The file CHANGES contains a more complete change log that includes information about previous releases.
- Python
Published by mhucka over 5 years ago
eprints2archives - Version 1.2.1 – Minor bug fixes
Changes in this release:
- Add missing
requirements.txtdependency forh2package. - Make parsing of malformed id ranges slightly more robust.
- Fix incorrect pluralization of an info message.
- Remove accidentally left-in invocation of
pdbupon errors even if debugging not enabled. - Edit the README.md file slightly.
The file CHANGES contains a more complete change log that includes information about previous releases.
- Python
Published by mhucka over 5 years ago
eprints2archives - Version 1.2.0 – Coverage improvements and use of HTTP/2
Changes in this release:
- In addition to the record pages,
eprints2archivesnow also harvests general URLs from the server, including the top-level URL and/viewand 2 levels of pages underneath it. However, if a subset of records is requested, only gets those particular/view/X/N.htmlpages rather than all pages under/view/X/. - Internal changes allow it to use protocol HTTP/2, which was necessary to communicate with Archive.Today (because it appears to have stopped accepting save requests unless HTTP2 is used).
- Now tries to add
https://orhttp://if the user forgets to provide it, and also removes/eprintand adds/restif needed. This makes it possible for the user to just provide a host name andeprints2archiveswill figure out the rest. - Minor improvements to some of the run-time status messages.
- More progress bars!
- Improvements to debug logging.
- Improvements to README.md.
- Internal code refactoring.
- Python
Published by mhucka over 5 years ago
eprints2archives - Version 1.1.0 – Better coverage of /view pages
The main change in this release is the addition of some simple web-scraping code to extract the pages under /view (as well as the subpages of those pages) from an EPrints site. The approach looks for some common HTML/CSS constructs in EPrints pages and should work for normal EPrints 3.x sites.
- Python
Published by mhucka over 5 years ago
eprints2archives - Version 1.0.0 – First full release
eprints2archives is a program that can obtain records from an EPrints server and send them to public web archiving services such as the Internet Archive and other sites. The program is written in Python 3 and works over a network using an EPrints server's REST API. It provides a command-line interface and can be used on macOS, Linux, and Windows systems. You can run eprints2archives with the --help option to get more information and help with usage.
This release supports Internet Archive and Archive Today. More web archiving services will hopefully be added in the future.
- Python
Published by mhucka over 5 years ago