Commit Graph

5053 Commits

Author SHA1 Message Date
Mike Fährmann
e7922ababd [naver] change categories (#7746)
- 'naver'        -> 'naver-blog'
- 'chzzk'        -> 'naver-chzzk'
- 'naverwebtoon' -> 'naver-webtoon'
2025-07-02 23:20:40 +02:00
Mike Fährmann
a4fb9e6d33 [furaffinity] fix 'submissions' results for new layout (#7759) 2025-06-30 19:05:19 +02:00
Mike Fährmann
8764f32ea7 [hitomi] fix negative tag searches (#7694) 2025-06-30 18:40:05 +02:00
Mike Fährmann
95338ff0ec replace 'result' with 'results' for lists
more consistent names
2025-06-30 12:10:57 +02:00
Mike Fährmann
3810555bbd do not use 'append = list.append' 2025-06-30 11:42:44 +02:00
Mike Fährmann
755b2a7eb2 [sankaku] fix extracting extended tag categories (#7744)
by sending a proper Referer header
and not one from https://sankaku.app/
2025-06-29 22:15:20 +02:00
Mike Fährmann
ef12882ff7 [behance] fix '403 Forbidden' error (#7710)
update internal cookies
2025-06-29 21:36:12 +02:00
Mike Fährmann
b102c474c1 [bunkr] fix file downloads (#7747)
update 'root_dl', 'root_api', and default 'endpoint'
2025-06-29 17:53:13 +02:00
Mike Fährmann
22b40fc787 [sankaku] remove 'id-format' option (#5073 #6808) 2025-06-29 17:50:19 +02:00
Mike Fährmann
f2a72d8d1e replace 'request(…).json()' with 'request_json(…)' 2025-06-29 17:50:19 +02:00
Mike Fährmann
9dbe33b6de replace old %-formatted and .format(…) strings with f-strings (#7671)
mostly using flynt
https://github.com/ikamensh/flynt
2025-06-29 17:50:19 +02:00
enduser420
f77e98b57d [chzzk] add 'comment' and 'community' extractors (#7735 #7741)
* [chzzk] add 'comment' and 'community' extractors
* [chzzk] update
* [chzzk] add tests
* [chzzk] update docs/supportedsites
* [chzzk] add 'offset' option
* [docs] add 'offset' option to gallery-dl.conf
2025-06-28 15:27:19 +02:00
Mike Fährmann
c8e4a2f8d1 [comick] use 'chapter_hid' as default 'archive-format' (#6782)
'chapter_hid' seems to be all that is needed to uniquely identify a
chapter, as comick.io is able to redirect to a full chapter URL from
one that only includes this value:

https://comick.io/comic/_/L7TaJB4n
  ->
https://comick.io/comic/neko-no-oshigoto/L7TaJB4n-chapter-10.5-en
2025-06-28 09:02:33 +02:00
Mike Fährmann
d969dbbab1 [common] fix bug in df6f4e53 when first group is empty 2025-06-28 09:02:33 +02:00
Mike Fährmann
5e61fe8668 [rule34xyz] implement login with username & password (#7736) 2025-06-27 22:35:59 +02:00
Mike Fährmann
fde6110d96 [kemono] extend 'duplicates' option (#7696)
allow selecting which file type(s) ('file'/'attachment'/'inline')
not to consider when discarding duplicates
2025-06-27 21:32:34 +02:00
Mike Fährmann
df6f4e5307 [common] only auto-set page_url when first group starts with / 2025-06-26 23:35:53 +02:00
Mike Fährmann
26e81e4162 [common] rename 'gallery_url'/'manga_url' to 'page_url 2025-06-26 22:06:57 +02:00
Mike Fährmann
8a93616a2d [webtoons] add 'banners' option (#6468) 2025-06-26 19:29:52 +02:00
Mike Fährmann
3c6a5657ea [webtoons] update code 2025-06-26 15:24:37 +02:00
Mike Fährmann
ebaf789ac7 [tenor] extract more metadata
- format
- id_format
- size
- duration
2025-06-26 10:42:48 +02:00
Mike Fährmann
718ed6a1e7 [tenor] update code to 3.8 2025-06-26 08:38:37 +02:00
Mike Fährmann
3084ba1243 [comick] improve 'manga' extraction (#6782)
- extract all chapters
- support query parameters
2025-06-25 14:33:19 +02:00
Mike Fährmann
a50901c460 [comick] give 'w', 'h', and 's' more descriptive names (#6782) 2025-06-25 14:09:11 +02:00
Mike Fährmann
41ae7695c4 [patreon] update code to 3.8 2025-06-25 07:54:55 +02:00
Mike Fährmann
bd410fab03 [patreon] extract more metadata for paywalled posts (#6582)
https://github.com/mikf/gallery-dl/issues/6582#issuecomment-3002275572
2025-06-25 07:18:59 +02:00
Mike Fährmann
578aea51ed [comick] add initial support (#1825 #6782) 2025-06-24 18:59:50 +02:00
Mike Fährmann
e7107f160d [fanbox] add 'fee-max' option (#7726) 2025-06-24 14:47:09 +02:00
Mike Fährmann
308373e3cb [fanbox] restructure some post extraction code
- move post info API request out of _get_post_data()
- use request_json()
2025-06-24 14:13:34 +02:00
Mike Fährmann
b43ce6d897 [rule34us] prioritize 'video.rule34.us' for video downloads (#6582)
https://github.com/mikf/gallery-dl/issues/6582#issuecomment-2999111612
2025-06-24 09:36:51 +02:00
Mike Fährmann
52ca441dc2 [patreon] yield metadata for paywalled posts as well (#6582)
https://github.com/mikf/gallery-dl/issues/6582#issuecomment-2998535431
2025-06-24 09:09:52 +02:00
Mike Fährmann
92ac35c932 [imgbox] fix 'TypeError' when calling 'get_image_metadata()'
fixes regression introduced in 811b665e33
2025-06-23 18:16:46 +02:00
Mike Fährmann
97cac544ed [facebook] fix 'TypeError' when calling 'decode_all()'
fixes regression introduced in 811b665e33
2025-06-23 17:55:32 +02:00
Luca Russo
1055773301 [discord] support forwarded messages + handle missing threads (#7706 #7722)
* support forwarded messages + handle missing threads
* flake
2025-06-23 17:14:31 +02:00
Mike Fährmann
fc1ed03484 [dankefuerslesen] fix several issues (#7669)
- return chapters in first-to-last order
- provide 'group' as list of names
- remove hardcoded 'en' and 'English' language results
- replace 'popitem()' with non-destructive 'next(iter(…))'
- 'chapter-minor' -> 'chapter_minor'
2025-06-22 19:52:40 +02:00
Mike Fährmann
9c9eb7a3bb [tiktok] detect login page redirects (#7716) 2025-06-22 13:14:59 +02:00
Mike Fährmann
68960e29a1 [dankefuerslesen] add support (#7669) 2025-06-22 12:13:12 +02:00
Mike Fährmann
b0d7de3603 support using system certificates via 'truststore' (#6582)
https://github.com/mikf/gallery-dl/issues/6582#issuecomment-2989290495
2025-06-20 19:55:01 +02:00
Mike Fährmann
74c9356442 [rule34] fix file downloads (#7697)
replace 'api-cdn' subdomain of image files with 'wimg'
2025-06-20 15:07:10 +02:00
Mike Fährmann
dba45923c8 [dl:http] fail downloads with HTML content (#7697)
add 'validate-html' option
2025-06-20 14:34:40 +02:00
Mike Fährmann
e8d28d3d0d merge #7699: [warosu] handle missing images by checking hostname (#7698) 2025-06-20 13:17:44 +02:00
NecRaul
5ba7c98bc2 [warosu] Simpler/less costly hostname check 2025-06-20 14:27:21 +04:00
NecRaul
3c85032b9b [warosu] Handle missing images in the extractor by checking hostname 2025-06-20 09:34:49 +04:00
Mike Fährmann
4279928d0b [gelbooru_v02] extract 'total' / 'search_count' metadata (#7689) 2025-06-19 19:15:27 +02:00
Mike Fährmann
fcd1b8a155 [common] add a 'kwdict' member to extractor instances
to allow setting general metadata at any point and without having to
rely on a manually implemented 'metadata()' method
2025-06-19 19:08:35 +02:00
Mike Fährmann
a80d55d974 [common] improve 'user-agent' override logic
Prevent a general 'user-agent' setting, i.e. extractor.user-agent, from
overriding the User-Agent header set by an extractor's 'browser'
option, default or otherwise, meaning only extractor-level or top-level
'user-agent' settings will override the 'browser' User-Agent.

https://github.com/mikf/gallery-dl/issues/7382#issuecomment-2985296321
2025-06-19 18:58:39 +02:00
Mike Fährmann
4f7a0e5d02 [arcalive] fix download URLs (#7678)
https://github.com/danbooru/danbooru/pull/6041
2025-06-18 21:12:18 +02:00
Mike Fährmann
b0580aba86 update 'match.lastindex' usage 2025-06-18 20:24:13 +02:00
Mike Fährmann
41191bb60a 'match.group(N)' -> 'match[N]' (#7671)
2.5x faster
2025-06-18 13:05:58 +02:00
pocketinternet
3ea244eebb [webtoons] add 'thumbnails' option (#6468 #7441)
* Update webtoons.py
    Added thumbnail download capability which defaults to false
* Update configuration.rst
    Added documentation for webtoon thumbnail option
* extract thumbnails in GalleryExtractor.assets()
* simplify & fix flake8
* include 'type' in default filenames
* add test
* update docs

---------

Co-authored-by: Mike Fährmann <mike_faehrmann@web.de>
2025-06-17 19:34:58 +02:00