Commit Graph

3055 Commits

Author SHA1 Message Date
HRXN
e13cae182b [nozomi] Extend default archive-fmt for Tag and Search Extractor (#1529)
Closes #1523
2021-05-04 19:26:35 +02:00
Mike Fährmann
bc868e7bb8 consider apparently long extensions as part of the filename
(#1516)
2021-05-02 21:15:50 +02:00
Mike Fährmann
2133f1d77f [readcomiconline] change domain to 'readcomiconline.li'
(closes #1517)
2021-05-01 16:41:16 +02:00
Mike Fährmann
66f28e471c [kemonoparty] update file URLs directly linking to kemono.party
(#1514)
2021-05-01 02:30:10 +02:00
Mike Fährmann
6fa20d456b [sankaku] update invalid-token detection (fixes #1515) 2021-04-30 22:04:45 +02:00
Mike Fährmann
4b65ebf652 [kemonoparty] fix file URLs (#1514)
files are now hosted on https://data.kemono.party/
2021-04-29 19:36:34 +02:00
Mike Fährmann
fa519f9202 [pixiv] change 'translated-tags' option (#1507)
- rename to 'tags'
- use string-values: "japanese", "translated", "noop"
- remove duplicate entries for "translated" tags
2021-04-29 19:30:43 +02:00
Mike Fährmann
5b4da4b4bf reorder config access in Job constructor
(#1111)
2021-04-27 15:12:59 +02:00
Mike Fährmann
221015e586 [downloader:http] disable filename extension changes for ugoira
(#1507)
2021-04-27 01:29:09 +02:00
Mike Fährmann
e5123f56c9 fix crash when using --no-download with --ugoira-conv (#1507) 2021-04-26 23:35:44 +02:00
Mike Fährmann
07b6661a87 release version 1.17.3 2021-04-25 21:23:26 +02:00
Mike Fährmann
c6c4a73f87 update fanbox entry in supportedsites.md 2021-04-25 19:44:19 +02:00
thatfuckingbird
e47952ac14 add extractors for fantia and fanbox (#1459)
* add extractors for fantia and fanbox

* appease linter

* make docstrings unique

* [fantia] refactor post extraction

* [fantia] capitalize

* [fantia] improve regex pattern

* code style

* capitalize

* [fanbox] use BASE_PATTERN for url regexes

* [fanbox] refactor metadata and post extraction

* [fanbox] improve url base pattern

* [fanbox] accept creator page links ending with /posts

* [fanbox] more tests

* [fantia] improved pagination

* [fanbox] misc. code logic improvements

* [fantia] finish restructuring pagination code

* [fanbox] avoid making a request for each individual post when processing a creator page

* [fanbox] support embedded videos

* [fanbox] fix errors

* [fanbox] document extractor.fanbox.videos

* [fanbox] handle "article" and "entry" post types, all embeds

* [fanbox] fix downloading of embedded fanbox posts
2021-04-25 19:39:13 +02:00
Mike Fährmann
d900edfcfb [simplyhentai] fix extraction 2021-04-25 18:51:43 +02:00
Mike Fährmann
ba8180b5e6 [bcy] don't crash with deleted posts 2021-04-25 18:51:09 +02:00
Mike Fährmann
d108421461 [myportfolio] fix extraction 2021-04-24 01:22:57 +02:00
Mike Fährmann
8b22d4e667 [mangapark] use '"browser": "firefox"' by default
to get rid of Cloudflare CAPTCHA resonses
2021-04-23 23:21:02 +02:00
Mike Fährmann
77a9cc6fd6 update supportedsites.md entry for Instagram 2021-04-23 23:21:01 +02:00
Mike Fährmann
9514cb8c12 [exhentai] update 'limits' check (#1487)
Only use 'limits' to set a custom upper bound.
Checking if the actual maximum gets exceeded is not necessary.
2021-04-23 23:20:45 +02:00
thatfuckingbird
141ca4ac0a [pixiv] also save untranslated tags when translated-tags is enabled (#1501) 2021-04-23 23:02:41 +02:00
Renan Vedovato Traba
9322c5e43b [exhentai] restore limit config (#1487)
This partially reverts commit e9ec91c8
2021-04-22 21:21:41 +02:00
Mike Fährmann
cb86bb9cc9 [hentaicosplays] add 'slug' metadata field (closes #1483) 2021-04-19 16:28:01 +02:00
Mike Fährmann
b4ed7cb961 fix 'category-transfer' (#1111)
broken since commit 055c32e0
2021-04-19 00:55:44 +02:00
Mike Fährmann
dddda7d0e7 [hentaicosplays] use GalleryExtractor (#1473) 2021-04-18 20:30:39 +02:00
Mike Fährmann
d88e34f17e [webtoons] use GalleryExtractor 2021-04-18 20:28:31 +02:00
Mike Fährmann
c4210b5371 [webtoons] update agegate/GDPR cookies 2021-04-18 20:28:31 +02:00
Mike Fährmann
d89eb7536b [naverwebtoon] use GalleryExtractor 2021-04-18 20:28:31 +02:00
Mike Fährmann
9b52eb9bf1 [naverwebtoon] ignore non-comic images 2021-04-18 20:28:30 +02:00
Mike Fährmann
bdfcc9c4b1 update extractor test results 2021-04-18 20:28:15 +02:00
Hans Christian Gunawan
334d690687 [hentaicosplays] Add extractor (#1473) 2021-04-18 20:28:00 +02:00
Mike Fährmann
82c32d25af [500px] update query hashes 2021-04-15 17:28:31 +02:00
Mike Fährmann
de14b7ad7a [slideshare] fix extraction 2021-04-15 17:15:59 +02:00
Mike Fährmann
bef3105121 [komikcast] fix extraction 2021-04-15 17:04:53 +02:00
Mike Fährmann
086925e685 [shopify] support omgmiamiswimwear.com (closes #1280) 2021-04-13 23:54:03 +02:00
thatfuckingbird
224b883ff4 [danbooru] add option for extended metadata extraction (#1458)
* [danbooru] add option for extended metadata extraction

* appease linter

* [danbooru] update docs/configuration.rst

* [danbooru] rename extended-metadata -> metadata
2021-04-13 23:41:30 +02:00
thatfuckingbird
dff03a6605 [booru] add an option to extract notes (only gelbooru for now) (#1457)
* [booru] add an option to extract notes (currently implemented only for gelbooru)

* appease linter

* [gelbooru] rename "text" to "body" in note extraction

* add a code comment about reusing return value of _extended_tags
2021-04-13 23:40:24 +02:00
Mike Fährmann
78d7ee3ef4 [yuki] remove module for yuki.la 2021-04-12 21:42:32 +02:00
Mike Fährmann
a86ffb04bb add 'output.fallback' option
to enable/disable fallback URLs for -g/--get-urls
2021-04-12 02:00:41 +02:00
Mike Fährmann
5a98bcec3a [deviantart] improve folder name matching (fixes #1451) 2021-04-11 20:39:40 +02:00
thatfuckingbird
918b0441fb [gelbooru] fix tag category extraction (#1455) 2021-04-10 19:05:00 +02:00
Mike Fährmann
fe6ce5b8f8 [erome] skip deleted albums (fixes #1447) 2021-04-09 15:24:18 +02:00
Mike Fährmann
457abf0e71 [deviantart] fix pagination for Eclipse results (fixes #1444)
- don't crash on missing keys
- use fallback for invalid 'nextOffset' values
2021-04-09 15:16:56 +02:00
Mike Fährmann
dee540050f [8muses] fix JSON unobfuscation
limit the characters that get modified,
leave non-ASCII characters alone
2021-04-09 01:49:54 +02:00
Mike Fährmann
b869b3a9eb [instagram] fetch media for incomplete GraphSidecar posts
GraphSidecar results from /tagged pages don't contain
all media elements, only the first one.

(#1439)
2021-04-09 00:37:16 +02:00
Mike Fährmann
b0686d2174 [instagram] update query hashes 2021-04-09 00:37:15 +02:00
Mike Fährmann
e8e3717b71 [instagram] add extractor for /tagged posts (#1439) 2021-04-09 00:37:08 +02:00
Mike Fährmann
abafe71e04 [exhentai] fix image limit detection (closes #1437)
check for image limit message when downloading original files
2021-04-08 21:33:41 +02:00
Mike Fährmann
a75e485461 add archive format to InfoJob output (#875) 2021-04-07 21:50:16 +02:00
Mike Fährmann
52a7913abe [artstation] download /4k/ images (#1422) 2021-04-07 21:50:16 +02:00
Mike Fährmann
37940193a6 build executables with SOCKS proxy support (closes #1424) 2021-04-07 21:50:03 +02:00