Commit Graph

6177 Commits

Author SHA1 Message Date
Mike Fährmann
bc7e95684d [piczel] fix extraction (#6735)
- fix pagination
- update API endpoints
- provide 'count' metadata field
- use BASE_PATTERN and self.groups[…]
2024-12-27 15:08:08 +01:00
Mike Fährmann
167a726972 [szurubooru] support 'visuabusters.com/booru' (#6729) 2024-12-26 19:04:16 +01:00
Mike Fährmann
998f949db1 [civitai] add 'user-videos' extractor (#6644) 2024-12-26 10:18:54 +01:00
Mike Fährmann
c6d5e25055 [workflows:executables] use Python 3.13 2024-12-25 19:50:26 +01:00
Mike Fährmann
99de0e1867 [instagram] fix 'pinned' values for '/reels' results (#6719) 2024-12-25 19:42:50 +01:00
Mike Fährmann
3024dce06b [8muses] skip albums without valid 'permalink' (#6717) 2024-12-24 13:49:19 +01:00
Mike Fährmann
09b2f8ea9e [batoto] update domains (#6714)
- support 'fto.to' and 'jto.to'
- use 'xbato.org' for deprecated domains
2024-12-24 09:38:07 +01:00
Mike Fährmann
f9d3603bfc [hitomi] fix searches (#6713) 2024-12-24 09:36:29 +01:00
Mike Fährmann
a3fb03c943 [release] ensure executables have a minimum size
to prevent issues like #6699 from happening again
2024-12-23 16:07:41 +01:00
Mike Fährmann
081856b9ce [kemonoparty] handle 'discord' favorites (#6706) 2024-12-22 18:56:21 +01:00
Mike Fährmann
de9442ba75 [directlink] use domain as 'subcategory' (#6703) 2024-12-22 17:19:56 +01:00
Mike Fährmann
18491a4ce6 [tapas] fix TypeError for locked episodes (#6700) 2024-12-21 15:17:51 +01:00
Mike Fährmann
454f766f5e release version 1.28.2 2024-12-20 19:13:42 +01:00
Mike Fährmann
6059ffccf8 [deviantart] improve 'tiptap' to HTML conversion (#6686)
- fix "KeyError: 'attrs'" for links without 'href'
- support 'strike' text markers
- support 'heading' content blocks
2024-12-20 16:45:19 +01:00
Mike Fährmann
e0514817bd [saint] support 'saint2.cr' URLs (#6692) 2024-12-19 11:43:35 +01:00
Mike Fährmann
8fbcdc1a3d [instagram] extract 'date' for stories (#6677)
generalize 'date' extraction for all post types
2024-12-18 16:33:21 +01:00
Mike Fährmann
fd5869f7df [bilibili] support '/upload/opus' URLs (#6687) 2024-12-18 08:53:27 +01:00
Mike Fährmann
5fbd0c3a63 [bilibili] extract files from 'module_top' entries (#6687) 2024-12-18 08:45:29 +01:00
Mike Fährmann
041baf8441 [common] compute and use latest Firefox UA
instead of the latest ESR UA
2024-12-17 22:20:37 +01:00
Mike Fährmann
0802e42c90 [common] use random unused port for '"user-agent": "browser"' 2024-12-17 21:40:20 +01:00
Mike Fährmann
9f3e4511c6 [tapas] restructure extractors (#6680)
- handle all episodes with TapasEpisodeExtractor
- prevent locked episodes from stopping processing of all following
  episodes
2024-12-17 21:36:37 +01:00
Mike Fährmann
5ab2ae17bc support wildcards for parent>child categories (#6673)
For example "reddit>*" for all reddit child extractors
2024-12-16 08:50:18 +01:00
Mike Fährmann
b6b1008ef2 [kemonoparty] support new favorite URLs (#6676) 2024-12-16 07:45:33 +01:00
Mike Fährmann
5e13235aca [zerochan] fix parsing regular JSON
i.e. remove debug remains ...
2024-12-14 20:41:08 +01:00
Mike Fährmann
7f6a53c347 [cohost] add 'avatar' and 'background' options (#6656) 2024-12-14 20:16:28 +01:00
Mike Fährmann
94d7df186f [bluesky] default to /posts if reposts/quoted is enabled (#6583) 2024-12-13 22:24:37 +01:00
Mike Fährmann
85a37ca039 [facebook] decode surrogate pairs in metadata values (#6599) 2024-12-12 20:20:30 +01:00
Mike Fährmann
a33065be86 [zerochan] parse API response manually when json.loads() fails (#6632) 2024-12-12 19:57:37 +01:00
Mike Fährmann
d2c66ac34d [zerochan] fix 'source' extraction when not logged in 2024-12-12 18:16:11 +01:00
Mike Fährmann
63008f77e2 merge #6607: [lofter] add initial support
(#650, #2294, #4095, #4728, #5656)
2024-12-11 20:41:52 +01:00
Mike Fährmann
717081dabd [lofter] update
- add tests
- update docs/supportedsites
- provide 'date' metadata
- simplify/restructure some code
2024-12-11 20:39:01 +01:00
Mike Fährmann
0e942f0829 merge #6613: [itaku] add 'search' extractor 2024-12-11 11:54:33 +01:00
Mike Fährmann
b58af14bdb [itaku] update
- simplify code
- update docs/supportedsites
- update test results
2024-12-11 11:52:42 +01:00
Mike Fährmann
86334f9c4a [yiffverse] add support (#6611) 2024-12-11 10:57:21 +01:00
Mike Fährmann
473ee5ff85 [recursive] add 'https://' to URLs if not present 2024-12-10 17:16:52 +01:00
Mike Fährmann
e8826ed3d4 [common] simplify HTTP error messages
[warning] HTTPSConnectionPool(host='domain.tld', port=443): Max retries
exceeded with url: /a.jpg (Caused by NameResolutionError("<urllib3.
connection.HTTPSConnection object at 0x7247fe436ea0>: Failed to resolve
'domain.tld' ([Errno -2] Name or service not known)")) (1/5)

->

[warning] NameResolutionError: Failed to resolve 'domain.tld'
([Errno -2] Name or service not known) (1/5)
2024-12-10 17:13:44 +01:00
Mike Fährmann
86f3f3f763 [common] detect DDoS-Guard challenge pages 2024-12-08 21:39:04 +01:00
Mike Fährmann
47311352de [cyberdrop] add extractor for media URLs (#2496)
https://github.com/mikf/gallery-dl/issues/2496#issuecomment-2495467133
2024-12-08 20:57:12 +01:00
hdk5
0466fcab4c [lofter]: add initial support 2024-12-08 19:37:42 +02:00
Mike Fährmann
d7873b9eb7 release version 1.28.1 2024-12-07 18:03:48 +01:00
Mike Fährmann
939cf51b01 [danbooru] add missing ':' to 'md5' tag prefix 2024-12-07 17:39:25 +01:00
Mike Fährmann
ef7ff31117 [realbooru] fix extraction (#6543)
- extract data from HTML pages since API is no longer usable
- move code into its own separate 'realbooru' module
2024-12-07 17:39:25 +01:00
Mike Fährmann
fbb4b222ec [inkbunny] fix re-login loop (#6618) 2024-12-07 17:39:25 +01:00
Mike Fährmann
e1613fc0f4 [nhentai] select random file servers for download URLs (#6620)
i1, i2, i3, i4 instead of just i.nhentai.net
2024-12-07 17:39:25 +01:00
Mike Fährmann
7091904b20 [common] restore using environment proxies by default (#6553, #6609)
change 'proxy-env' default to 'true'
2024-12-07 17:38:44 +01:00
Shelvacu
b90c77d8f1 [itaku] add 'search' extractor 2024-12-05 21:09:38 -08:00
Mike Fährmann
34e157e166 [zerochan] download webp and gif files, add 'extensions' option (#6576) 2024-12-05 21:25:44 +01:00
Mike Fährmann
624dc7f407 [bluesky] add 'info' extractor 2024-12-05 08:36:33 +01:00
Mike Fährmann
a526a3d00d [patreon] add 'format-images' option (#6569) 2024-12-04 21:38:01 +01:00
Mike Fährmann
45ce0a2797 [instagram] handle empty 'carousel_media' entries (#6595) 2024-12-04 18:31:23 +01:00