Hi,
I just wanted to let people know that the openaleph command line interface and Python library exists. It’s the next iteration of the original alephclient with some quality-of-life improvements for crawldir
.
The biggest difference is that it now works with a local db that holds information about the state of the ingest, allowing for cancelling/resuming and a basic report at the end. I also move the --no-junk
logic out into a separate dot-ignore file, which gives users much more control over what to skip.