Commit Graph

4068 Commits

Author SHA1 Message Date
Mike Fährmann
2dec4bef60 merge #5591: [poipoku] avoid language-specific extraction 2024-05-17 22:47:51 +02:00
Mike Fährmann
7f1ed909d5 [imgur] match gallery/album/image URLs with title slugs (#5593) 2024-05-17 22:44:37 +02:00
fireattack
831f922c1c [poipoku] avoid language-specific extr 2024-05-14 02:42:02 +08:00
Mike Fährmann
33b07c4603 [twitter] wait for rate limit reset before 429 error (#5532) 2024-05-12 01:13:30 +02:00
Mike Fährmann
3b6f306a8b [exhentai] fix blank page detection 2024-05-11 22:01:34 +02:00
Mike Fährmann
f178839fc2 [exhentai] fix multi-page viewer detection (#4969) 2024-05-11 18:32:52 +02:00
Mike Fährmann
5b6b5dd81b [8chan] fix downloaded files by sending 'TOS' cookie (#5578) 2024-05-11 15:34:21 +02:00
Mike Fährmann
28039229fe [common] use 'create_urllib3_context' for creating SSLContexts
enables dumping TLS session keys by setting SSLKEYLOGFILE (#5215)
as well as other potentially useful settings.
2024-05-10 22:59:29 +02:00
Mike Fährmann
33006fe126 [common] disable 'check_hostname' for non-urllib3 SSLContexts
e.g. when 'browser' is set to a non-empty value and gallery-dl creates
its own SSLContext instance instead of using requests' and urllib3's
defaults.

urllib3 disables this option for its default contexts,
since it does this check on its own.

Fixes "ValueError: Cannot set verify_mode to CERT_NONE when
check_hostname is enabled" when using --no-check-certificate.

(#3614, #4891, #5576)
2024-05-10 18:20:08 +02:00
Mike Fährmann
2ee9ffeed6 merge #5568: [furaffinity] match 'xfuraffinity' URLs 2024-05-09 19:20:12 +02:00
Mike Fährmann
b27ffd790a [poipiku] fix downloading R-18 posts (#5567)
… by automatically sending a `POIPIKU_CONTENTS_VIEW_MODE=1` cookie
to enable "adult" mode.
2024-05-09 15:14:08 +02:00
Delphox
11109d5bad [furaffinity] match xfuraffinity.com 2024-05-08 12:15:47 -03:00
Mike Fährmann
bd8e4797e5 [vsco] add 'avatar' extractor (#5341) 2024-05-02 18:12:19 +02:00
Mike Fährmann
619bf5c644 [inkbunny] retry API calls with a loop instead of recursion 2024-05-01 22:46:24 +02:00
Mike Fährmann
d11ec00908 [common] fix _cfgpath for BaseExtractor objects
After the changes in 0c17884673,
_cfgpath was mssing its 'category' value
since that hadn't been initialized yet.
2024-05-01 16:00:07 +02:00
Mike Fährmann
6db1837a6d [bluesky] filter reposts only for user timelines (#5528) 2024-05-01 16:00:07 +02:00
Mike Fährmann
3cf5366143 [mastodon] add support for card images 2024-05-01 16:00:07 +02:00
Mike Fährmann
9b1995dda3 [mastodon] add 'favorite', 'list', and 'hashtag' extractors (#5529) 2024-05-01 15:59:34 +02:00
Mike Fährmann
7fd31aaf8e merge #5516: [mastodon] send canonical 'true'/'false' boolean values
instead of '1'/'0' to be compatible with Mitra instances or others that
do not fully implement Mastodon's boolean value semantics.
2024-04-27 01:34:56 +02:00
cenodis
3ba5fd9efd [mastodon] Use boolean instead of integer keys for accounts/statuses endpoint 2024-04-26 22:51:56 +02:00
Mike Fährmann
5aefa9465c [furaffinity] simplify pattern, match original domain first 2024-04-25 21:58:34 +02:00
Delphox
6969963125 [furaffinity] support fxfuraffinity and fxraffinity urls 2024-04-25 13:25:39 -03:00
Delphox
85550a37c4 [twitter] support fixvx urls 2024-04-25 13:19:40 -03:00
Mike Fährmann
ddffeeaa7b [exhentai] detect CAPTCHAs during login (#5492) 2024-04-20 00:19:22 +02:00
Mike Fährmann
347af7f5c8 [twitter] handle missing 'expanded_url' fields (#5463, #5490) 2024-04-19 21:42:22 +02:00
Mike Fährmann
c9d3b5e5d9 [pixiv] change 'sanity_level' debug message to a warning (#5180) 2024-04-19 16:41:31 +02:00
Mike Fährmann
bffadf35b7 [hotleak] download files with 404 status code (#5395) 2024-04-19 16:08:31 +02:00
Mike Fährmann
a7d8cbab0e [common] show full URL in Extractor.request() error messages 2024-04-18 15:45:36 +02:00
Mike Fährmann
a5071c9ca0 [common] fix NameError in Extractor.request()
… when accessing 'code' after an requests exception was raised.

Caused by the changes in 566472f080
2024-04-18 15:42:53 +02:00
Mike Fährmann
566472f080 [common] add 'sleep-429' option (#5160) 2024-04-16 18:41:28 +02:00
Mike Fährmann
923c6f3214 [common] simplify 'status_code' check in Extractor.request() 2024-04-16 18:39:47 +02:00
Mike Fährmann
68f4208251 [common] update Extractor.wait() message format 2024-04-16 17:51:14 +02:00
Mike Fährmann
9e5d65fbf3 [twitter] improve username & password login procedure (#5445)
- handle more subtasks
- support 2FA
- support email verification codes
2024-04-16 01:25:04 +02:00
Mike Fährmann
b38a917355 [common] add Extractor.input() method 2024-04-16 00:02:48 +02:00
Mike Fährmann
257e9fb435 [gelbooru] improve pagination logic for meta tags (#5478)
similar to 494acabd38
2024-04-15 23:14:48 +02:00
Mike Fährmann
0d72789aa3 merge #5461: [cookies] use tempfile when saving cookies.txt files 2024-04-13 19:02:39 +02:00
Mike Fährmann
63ac06643f compute tempfile path only once 2024-04-13 18:59:18 +02:00
Mike Fährmann
141a93c8fd [docs] update docs/configuration links (#5059, #5369, #5423) 2024-04-13 02:18:44 +02:00
Mike Fährmann
85bbb59483 [twitter] implement 'relogin' option (#5445) 2024-04-12 23:01:22 +02:00
Mike Fährmann
e02d2ff45d [tapas] add 'creator' extractor (#5306) 2024-04-11 23:41:50 +02:00
Mike Fährmann
35d4a706ae [pixiv:novel] add 'covers' option (#5373) 2024-04-11 22:27:49 +02:00
Mike Fährmann
b57051719f [wikimedia] support wiki.gg wikis 2024-04-09 19:24:01 +02:00
Mike Fährmann
40c1a8e471 [wikimedia] fix exception for files with empty 'metadata' 2024-04-09 19:12:15 +02:00
Mike Fährmann
647a87d17c [twitter] match '/photo/' Tweet URLs (#5443)
fixes regression introduced in 40c05535
2024-04-06 17:56:21 +02:00
Mike Fährmann
86a97d8e27 [kemonoparty] fix crash on posts with missing datetime info (#5422) 2024-04-05 00:25:23 +02:00
Mike Fährmann
ef8f02c3d9 [oauth] use Extractor.request() for HTTP requests (#5433)
Enables using proxies and general network options.
2024-04-04 20:51:54 +02:00
Mike Fährmann
095e5ded6f [reddit] support comment embeds (#5366) 2024-04-01 23:35:42 +02:00
Mike Fährmann
64948f2c09 [foolfuuka] improve 'board' pattern & support pages (#5408) 2024-04-01 22:31:25 +02:00
Mike Fährmann
0c17884673 store 'match' and 'groups' in Extractor objects 2024-04-01 03:07:52 +02:00
Mike Fährmann
bdbc8be5f3 [twitter] prevent crash when extracting 'birdwatch' metadata (#5403) 2024-03-30 18:13:38 +01:00