Have had lost to rant about, but on time to do so
The last few weeks have been a lot more productive and things are now falling into place. One area of debate is just what material is 'Public Domain', and I've been checking into the various sources. I make no apology for using torrent to tidy up my own off-ait recording library. Pruning the adverts takes time, and so sharing the load with others just makes sense. Is this 'fair use', or is the problem stripping the adverts added by the broadcasters to cover their costs. Just where does the law stand in relation to this. The answer is obviously to document the acceptable public domain sources and this rant is a start.
I asked Mistral to give me a list of such sources which it did but it seems a little American in it's bias and I need European or UK sources.
- Internet Archive - Strange that this should be listed as it is mainly an archive of other peoples copyright material? But there are now areas that archive public domain material in addition to the Wayback Machine archive.
- Public Domain Review - Very definitely an American source although I do need to dig a little deeper into it.
- Project Gutenberg - a library of over 75,000 free eBooks but again the world's great literature, with focus on older works for which U.S. copyright has expired. (50 years of eBooks: 1971-2021)
- Google Books - It was in the list, but I question the ethics of the way it was originally created.
- Open Library - Open Library is an online project intended to create "one web page for every book ever published". But it's actually just part of the Internet Archive so perhaps no need to list separately.
When prompted for les American biased material a further list appeared which I will expand on at some point but the titles are
- HathiTrust Digital Library
- Europeana
- Library of Congress - Well I suppose it had to add this but it is just another American source
- Open Culture
- ManyBooks
One of the annoyances with Mistral is that it does not actually provide links with these sorts of list. So one has to do a manual search. That it's training set 'may be out of date' is a reasonable response, but long term established sites like these should not be a problem. I will move all this to a wiki page at some point, but for now it's just an aid memoir.