-
Notifications
You must be signed in to change notification settings - Fork 59
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add support for >150 new comics, 4 new comic hosts, and more #140
Conversation
I tried using `scripts/tapastic.py` provided by @Techwolfy but unfortunately Tapastic has changed their format, and it no longer works...
f2a23b3
to
fb8c667
Compare
Codecov Report
@@ Coverage Diff @@
## master #140 +/- ##
=======================================
Coverage 82.61% 82.61%
=======================================
Files 72 72
Lines 6282 6282
Branches 423 423
=======================================
Hits 5190 5190
Misses 977 977
Partials 115 115
Continue to review full report at Codecov.
|
Wow, thanks a lot! Actually, I would like to take those changes in smaller batches, but I might just cherry-pick some changes, so you have an easier time to get this merged... |
I'd like to use these updates. Should I anticipate these being merged in the next week or so, or should I pull from Techwolfy's upstream? @TobiX |
Not soon. I currently don't have the time to review this as one big chunk. Breaking this into smaller parts would really help speed up the review process... |
I've started cherry-picking commits starting from the oldest, I'm currently up to "Add Evon" - It would be really useful if you could rebase the branch (and probably also remove everything you reverted later). I didn't pick the following commits:
|
Rebased, and dropped the changes I reverted (those were for a few comics that block scraping).
For ShadesOfGray, the author couldn't keep updating the comic, but fans convinced them to finish the story by posting the rest of the script and paid the hosting bills for another year. It's actually why I started working on this project; I wanted to make a copy for myself before it went offline, and decided to grab every other comic I've ever read while I was at it. ^.^ I'd originally planned on writing my own scraper, but dosage was modular enough that it was easier to extend it. Unfortunately, some comics were only partially recoverable, and a few seem to be gone forever. It looks like the remaining Katbox comics might be next, as it's shutting down soon. |
First: Thanks for rebasing :D
Ah, I see. I looked at that commit and it does something which might negatively influence one of dosage's features: Using As I think more about this, it's probably not worth sacrificing "good" directory names like (Currently with limited internet, so no cherry-picks today) |
Currently it breaks SequentialArt and Drowtales.
|
Correction: It's broken for SequentialArt and Drowtales on Python 2.7. Although Misfile and WapsiSquare are having issues even under Python 3.7:
|
afd0d33
to
4e5805a
Compare
@MaddTheSane None of those issues are related to this change; the Python 2.7 bugs were there before, and the Misfile and WapsiSquare scrapers were both broken by website redesigns. I fixed the scraper for Misfile and added one for its new sub-comic. @TobiX I hadn't noticed the issue with the |
Some more notes while picking more to master:
|
I'll try to continue picking "easy" commits in a steady pace in the coming days. Hopefully, we have only the complex cases left by the end of the year 🐺 |
My guess why Shivae is in |
Rebased! I'll be off work for the holidays soon, so hopefully I'll have a bit more time to work on this as well.
|
I cherry-picked all "simple" commits to master. Only bigger modules (or modules which I mentioned above) should be left after a rebase... |
Rebased. I cleaned up the leftover duplicates and merge artifacts, but I think there are still a few recently-dead comics in there that I need to remove.
|
|
Rebased!
|
The only big chunk left now is webtoons. Probably will come around it next week. PS: Feel free to add yourself to |
Rebased. Almost done! Let me know if you want me to change or remove any of the remaining minor fixes in the PR so you can merge it normally.
|
Arrg. Webtoons hates me and serves me some GDPR-crap instead of comic pages :/ There is |
f09f680
to
ab7b6f0
Compare
@TobiX Ping; it's been a few weeks, anything else you want me to change so you can merge this? |
Sorry, not much bandwidth for Dosage in the last weeks. And adding more onto this won't make it any easier for me to merge. Please split your contributions into smaller pull requests in the future...
Multiple commits with no apparent functional change:
Cherry-picked all the easy ones, will take a look at the rest when I'm more awake. |
Rebased. Sorry again about the massive PR! I know it's a pain to code-review, thanks for pushing through it.
All that's left here is one last ComicFury entry and an extra padding digit for the WebToons image filenames (some comics use >100 images per strip). |
Thanks for merging this! |
This PR adds support for more than 150 new comics, plus all comics on 4 new comic-hosting sites. It also fixes various issues with more than 50 existing comics.
Sorry about the size of the pull request; let me know if there are any changes you'd like me to make.
New hosts:
New comics:
Fixed comics:
Additional changes: