Commit Graph

3917 Commits

Author SHA1 Message Date
Mike Fährmann
4b2a006871 [skeb] add 'search' extractor (#2945) 2022-09-21 17:57:55 +02:00
Mike Fährmann
94b34f460e [exhentai] add slash to the end of gallery URLs (#2947) 2022-09-21 17:54:20 +02:00
Mike Fährmann
2787c8511a [mastodon] warn about moved accounts (#2939) 2022-09-20 17:57:14 +02:00
Mike Fährmann
d699310fdf [blogger] add 'label' or 'query' metadata fields (#2930)
for '/search/label/…' or '/search?q=…' URLs
2022-09-20 11:37:39 +02:00
Mike Fährmann
eef50c1f28 [blogger] split 'search' extractor (#2930) 2022-09-19 21:01:21 +02:00
Mike Fährmann
64202dd012 release version 1.23.1 2022-09-18 14:01:09 +02:00
Mike Fährmann
d29fb94098 [bunkr] use 'media-files' servers for m4v and mov files (#2925) 2022-09-18 13:39:29 +02:00
enduser420
bd846abba0 [hotleak] add hotleak extractor (#2909) (#2890) 2022-09-18 13:37:16 +02:00
Mike Fährmann
e99a9b2aff [twitter] improve 'cards-blacklist' (#2875)
allow blacklisting domains and 'name:domain',
where 'domain' depends on a card's 'vanity_url' value
2022-09-17 17:46:34 +02:00
Mike Fährmann
aaf6992bae [twitter] fix new-style '/card_img/' URLs 2022-09-17 17:45:09 +02:00
Mike Fährmann
40baa77630 [twitter] provide proper 'date' for syndication results (#2920) 2022-09-17 14:11:43 +02:00
Mike Fährmann
46fe469c53 [tumblr] implement 'ratelimit' option (#2919) 2022-09-17 14:10:33 +02:00
Mike Fährmann
d0b73fec14 [flickr] add support for secure.flickr.com (#2910) 2022-09-14 16:19:27 +02:00
Mike Fährmann
35eddaa94e [reddit] prevent exception with empty submission URLs (#2913) 2022-09-14 16:14:42 +02:00
Mike Fährmann
464ea90d14 [exhentai] guess extension for original files (#2842)
makes it possible to sometimes, when guessed correctly ('.jpg'),
skip an original file download without costing image limit points
2022-09-14 16:06:27 +02:00
Mike Fährmann
551fdf7ad7 [exhentai] move 509 check into its own function 2022-09-13 18:27:14 +02:00
Mike Fährmann
7a799df17f [tumblr] pre-compile regular expressions 2022-09-13 17:50:48 +02:00
Mike Fährmann
73a52a95b0 update Cloudflare IUAM detection 2022-09-12 11:40:06 +02:00
Mike Fährmann
673b6f1218 [poipiku] use 'img-org.poipiku.com' as image domain (#2796) 2022-09-12 11:21:01 +02:00
Mike Fährmann
4ca1a6e5f3 [bunkr] fix extraction (#2903) 2022-09-09 18:09:52 +02:00
Mike Fährmann
8b76149521 [exhentai] improve 509.gif detection (#2901) 2022-09-09 18:09:52 +02:00
Mike Fährmann
bdad9c40dd remove whitespace before comments in input file URLs (#2808) 2022-09-09 18:09:21 +02:00
Mike Fährmann
b36125333f [postprocessor:zip] implement 'files' option (#2872) 2022-09-09 11:41:27 +02:00
Mike Fährmann
2ed58029f9 {paheal[ add proper support for videos (#2892) 2022-09-04 13:30:48 +02:00
Mike Fährmann
444dfb4aa6 [instagram] add 'highlight_title' and 'date' metadata
to highlight posts (#2879)
2022-09-03 16:21:26 +02:00
Mike Fährmann
7f764ebee6 [redgifs] "fix" download URLs (#2884) 2022-09-02 23:25:38 +02:00
Mike Fährmann
3cb8327c60 [zerochan] add 'metadata' option (#2861) 2022-09-02 23:25:19 +02:00
blankie
9745b48830 [tumblr] attempt to fetch high-quality inline images (#2877)
* [tumblr] attempt to fetch high-quality images (again)

Fixes #1846, and fixes #1344

* slight refactor

* update configuration.rst entry
2022-08-31 10:53:50 +02:00
Mike Fährmann
daef91c925 [smugmug] update default API credentials (#2881)
The old key lacked v2 access and I'm unable to accept
the new terms of service since my old account got deleted
2022-08-31 10:28:25 +02:00
Mike Fährmann
4d78ca89db [twitter] add 'cards-blacklist' option (#2875) 2022-08-31 10:28:25 +02:00
Mike Fährmann
4d7cb0bf56 [twitter] general support for unified cards (#2875)
just removing the 'type' check seems to work
2022-08-31 10:25:27 +02:00
Mike Fährmann
8839b0d2ee add section about global replacement fields to formatting.md
(#2862)
2022-08-30 21:32:22 +02:00
Mike Fährmann
1da415c160 update chapter filter section in README (#2864)
- use -o lang0fr for mangadex
- update URL
2022-08-30 21:31:55 +02:00
Mike Fährmann
f16fbe9f93 document 'extractor.twitter.expand' (#2848) 2022-08-30 18:16:20 +02:00
Mike Fährmann
7ddfff957c [twitter] support "image_website" unified cards (#2875) 2022-08-30 18:16:10 +02:00
Mike Fährmann
51f14223a8 release version 1.23.0 2022-08-28 19:54:55 +02:00
Mike Fährmann
2eb0ddd083 [hitomi] fix error when number of tag results is multiple of 25
(#2870)
2022-08-28 17:06:11 +02:00
Mike Fährmann
3cebf787c4 [slideshare] fix metadata extraction 2022-08-28 10:52:28 +02:00
Mike Fährmann
da11fb32d0 update extractor test results 2022-08-28 00:16:12 +02:00
Mike Fährmann
636d03df95 [nijie] reduce cache maxage to 90 days 2022-08-27 21:57:45 +02:00
Mike Fährmann
f375ec0ffa [vsco] fix 'collection' extraction 2022-08-27 21:16:22 +02:00
Mike Fährmann
8672f8a2b9 [skeb] fix archive_ids for thumbnails and article images
8cf5981ded (commitcomment-82316040)
2022-08-27 16:46:53 +02:00
Mike Fährmann
69995d789b Revert "[twitter] use '{author[name]' in default directory names"
This reverts commit 9ad3cdc5d8.
2022-08-27 15:11:59 +02:00
Mike Fährmann
946643c23c [hitomi] use maxage for gg.js cache (#2863)
cached values become invalid after 1-2 hours
2022-08-26 17:57:17 +02:00
Mike Fährmann
d508b2c049 [gelbooru] implement 'pool' pagination (#2853) 2022-08-26 17:57:17 +02:00
Mike Fährmann
67a2efb885 [rule34] implement 'pool' pagination (#2853) 2022-08-26 17:57:17 +02:00
Mike Fährmann
70dc4ce911 [skeb] ignore article images with empty URL
8cf5981ded (commitcomment-81980633)
2022-08-26 17:57:17 +02:00
Mike Fährmann
f362d4a3c7 [e621] fix 'popular' extraction 2022-08-26 17:57:17 +02:00
Mike Fährmann
7e385ed63e [foolfuuka] update domains
- remove nyafuu
- add rozenarcana (https://archive.alice.al/)
- add tokyochronos (https://www.tokyochronos.net)
2022-08-26 17:57:17 +02:00
Mike Fährmann
6ba72b6bc6 [twitter] ignore invalid user entries (#2850) 2022-08-26 17:57:17 +02:00