Commit Graph

6742 Commits

Author SHA1 Message Date
missionfloyd
13cb031fe2 [girlsreleased] add support (#6200)
* [girlsreleased] add support
* Lint
* Change "galleries" to "sets"
    As it is on the site
* Add pagination
    Add tests
* Fix tests
* Remove leftover print()
* Don't remove first set
* Yield pages
* Add filename metadata
* [girlsreleased] Refactor
* Return models as array
* Add filename numbering
    Add date metadata
* Add URL metadata
* Spawn set extractor the right way
* Adjust model/site regex
* update
    - restructure some code
    - remove constructors
    - use f-strings
* expand tests
* update docs/supportedsites
2025-06-16 19:18:19 +02:00
SpiffyChatterbox
48ac41605d [redbust] add support (#6759 #6918 #7043)
* init - Redbust.com Support
* Added Test
    Could use a second set of eyes on this
* update 'gallery' extractor
    - extract more metadata
    - simplify image extraction
    - support legacy galleries
* add tests
* update 'image' extractor
* add 'tag' extractor
* add 'archive' extractor
* restrict 'image' extractor pattern
* update docs/supportedsites
* replace quotes inside f-string

---------

Co-authored-by: Mike Fährmann <mike_faehrmann@web.de>
2025-06-16 12:10:42 +02:00
Mike Fährmann
6227b64de2 [workflows:tests] include Python 3.7 2025-06-16 12:03:55 +02:00
Mike Fährmann
d9432ee297 [warosu] restore correct 'now' values 2025-06-16 12:00:59 +02:00
Mike Fährmann
489eccb659 merge #7677: [warosu] HTML attribute fix (#7676) 2025-06-16 11:51:08 +02:00
NecRaul
3bc6bc7c77 [warosu] Single quotes when string has a quotation mark 2025-06-16 12:50:20 +04:00
NecRaul
f56e810f42 [warosu] Attribute fix 2025-06-16 12:33:16 +04:00
Mike Fährmann
0a3fac2dfe merge #7664: [archivedmoe] redirect URL fixes (#7652) 2025-06-15 10:03:34 +02:00
Mike Fährmann
b245218c1d [archivedmoe] reword some comments and variable names 2025-06-15 10:00:45 +02:00
hunter-gatherer8
96f5cfb305 [girlswithmuscle] add support (#4493 #6016)
* [girlswithmuscle] init
* [girlswithmuscle]: fix metadata extraction (site layout change)
* [girlswithmuscle]: fix tags extraction (site layout change)
* update login code
* update 'post' extractor
* update 'gallery' extractor, rename to 'search' extractor
* update docs
* add test cases

---------

Co-authored-by: Mike Fährmann <mike_faehrmann@web.de>
2025-06-14 23:05:49 +02:00
Mike Fährmann
b583891df6 [pixiv] remove '/jump.php' from *all* AJAX caption links (#4327)
https://github.com/mikf/gallery-dl/issues/4327#issuecomment-2969765775
2025-06-13 21:10:51 +02:00
Mike Fährmann
40dedd7ce0 [util] restore stdlib 're' module in filter expressions (#7665)
fixes regression introduced in 4fc719bb10
2025-06-13 20:52:33 +02:00
Mike Fährmann
fa1fc39a36 [pp:exec] implement 'commands' option
to run multiple commands in succession
and stopping if one fails
2025-06-13 20:19:01 +02:00
NecRaul
6668acf91e [archivedmoe] Sort boards alphabetically 2025-06-13 19:29:47 +04:00
NecRaul
3a4e19d284 [archivedmoe] Simplify board extraction from url 2025-06-13 18:44:02 +04:00
NecRaul
a7aa18a8c1 [archivedmoe] remove unnecessary logging 2025-06-13 18:28:21 +04:00
NecRaul
8b2adeb41e [archivedmoe] simplify board URL redirection logic 2025-06-13 18:26:39 +04:00
NecRaul
05081dea2e Lint with flake8 2025-06-13 17:56:43 +04:00
NecRaul
223fe960a0 [archivedmoe] redirect URL changes (again)
Redirects to warosu.org instead of 4chan's cdn for certain boards
Redirects to archive.4plebs.org instead of 4chan's cdn for /tg/
Slices the filename only if it's redirecting to certain archives
2025-06-13 17:43:16 +04:00
Mike Fährmann
9d3cf67f3e [pinterest] remove excess whitespace from 'description' fields (#4335)
and 'closeup_unified_description' & 'closeup_description'
2025-06-13 13:11:18 +02:00
Mike Fährmann
9afac3c51d [tests:results] rename '#urls' to '#results' 2025-06-13 13:00:50 +02:00
Mike Fährmann
e08ec7e083 update copyright notices 2025-06-13 00:03:41 +02:00
Mike Fährmann
811b665e33 remove @staticmethod decorators
There might have been a time when calling a static method was faster
than a regular method, but that is no longer the case. According to
micro-benchmarks, it is 70% slower in CPython 3.13 and it also makes
executing the code of a class definition slower.
2025-06-12 22:50:52 +02:00
Mike Fährmann
8b6bc54e95 [dl:http] add MIME type and signature for .html files 2025-06-12 21:16:34 +02:00
Mike Fährmann
e41c317039 [fantia] prevent '.html' file downloads 2025-06-12 20:41:47 +02:00
Mike Fährmann
e2d104a110 [twitter] extract 'source_id' and 'source_user' metadata (#7470 #7640) 2025-06-12 18:59:22 +02:00
Mike Fährmann
06e2f2cd91 [twitter] restructure media data extraction 2025-06-12 18:53:15 +02:00
Mike Fährmann
56ea27c474 [blogger] move original/s0 URL code into a separate function 2025-06-12 17:07:56 +02:00
Mike Fährmann
e0a961047c [dl:http] fix potential FileExistsError when '.part' file moved (#5385)
when the downloader detects a '.part' file and attempts to resume its
download, a FileExistsError was raised while trying to create an assumed
missing directory path when said '.part' file was (re)moved during the
time it was detected and the downloader attempted to open it in 'a+b' mode.
2025-06-12 16:44:41 +02:00
Mike Fährmann
5b60c66e0f [path] support multiple characters in path-restrict objects (#1707) 2025-06-12 15:29:13 +02:00
Mike Fährmann
16fc5e0d68 [batoto] fix downloading manga with alerts/notices (#7657)
and improve alert message extraction
2025-06-12 08:26:26 +02:00
Mike Fährmann
847fce6ef1 [tests:results] implement testing for logging messages 2025-06-12 08:22:28 +02:00
Mike Fährmann
a14671992c [sexcom] prevent '.css' file downloads (#7632)
by detecting homepage redirects
and improve redirect handling in general
2025-06-11 22:32:08 +02:00
Mike Fährmann
0df083b208 [vk] prevent '404 Not Found' errors for file downloads
only strip query parameters when regex substitution applies
2025-06-11 22:32:08 +02:00
Mike Fährmann
d065452ba3 merge #7653: [archivedmoe] fix redirection issue (#7652) 2025-06-11 20:04:42 +02:00
Mike Fährmann
80599fa610 [vk] fix 'user' metadata extraction
add boolean 'group' field
2025-06-11 20:01:27 +02:00
NecRaul
e3df99dbb9 Apply mikf's diff regarding Archived.moe
Moved (and refactored) code into remote()
Added a check for fixup_timestamp
2025-06-11 21:51:03 +04:00
Mike Fährmann
85931185a6 [vk] add continuation message (#7650) 2025-06-11 18:07:39 +02:00
Mike Fährmann
8287a1b372 [vk] detect redirects to 'challenge' pages (#7650) 2025-06-11 18:02:14 +02:00
Mike Fährmann
8645be8244 [archivedmoe] add redirect URL fixup test 2025-06-11 12:10:18 +02:00
NecRaul
4370654532 Simplify remote_media_link assignment 2025-06-11 04:49:21 +04:00
NecRaul
cb74d0f2f3 Lint with flake8 2025-06-11 04:46:18 +04:00
NecRaul
96bb2b1630 Fix Archived.moe redirection issue
Unless the board is /b/ (in which case redirection works fine),
remove the characters of the filename portion of the url until
filename portion of the url is 13 characters long (epoch millis).
2025-06-11 04:42:03 +04:00
Mike Fährmann
9348c9ce25 [workflows:executables] update to 'windows-2022'
'windows-2019' will no longer be supported by 2025-06-30
https://github.com/actions/runner-images/issues/12045
2025-06-10 22:59:18 +02:00
Mike Fährmann
b4aed5e2c9 [common] allow overriding 'user-agent' when 'browser' is used (#7647) 2025-06-10 22:05:28 +02:00
Mike Fährmann
8e698d1a64 [ytdl] set domain as subcategory when using Generic extractor (#6582)
https://github.com/mikf/gallery-dl/issues/6582#issuecomment-2959879730
2025-06-10 21:35:15 +02:00
Mike Fährmann
4cfddc144a [common] import 'datetime' class directly 2025-06-10 21:35:15 +02:00
Mike Fährmann
e68555defa [common] improve cookie-related logging messages 2025-06-10 21:34:27 +02:00
Mike Fährmann
511cf2363c [common] update expired cookie messages (#7644)
- prefix with 'cookies:'
- include domain
- include exact time when it expired
2025-06-09 18:48:04 +02:00
Mike Fährmann
7a48b696a6 [dl] improve maximum 'rate' detection (#7638) 2025-06-09 12:15:39 +02:00