Commit Graph

1427 Commits

Author SHA1 Message Date
Mike Fährmann
bcfce6b7db [dl:http] improve HTML signature check (#7697)
https://github.com/mikf/gallery-dl/issues/7697#issuecomment-2990734451

ignore leading whitespace
2025-06-20 14:39:32 +02:00
Mike Fährmann
e8d28d3d0d merge #7699: [warosu] handle missing images by checking hostname (#7698) 2025-06-20 13:17:44 +02:00
Mike Fährmann
bb73c8318c [warosu] add test for non-archived post 2025-06-20 12:38:54 +02:00
Mike Fährmann
4279928d0b [gelbooru_v02] extract 'total' / 'search_count' metadata (#7689) 2025-06-19 19:15:27 +02:00
Mike Fährmann
6d7ca4890f [tests] add missing '\n' to stdout.write() output 2025-06-19 11:39:18 +02:00
Mike Fährmann
4f7a0e5d02 [arcalive] fix download URLs (#7678)
https://github.com/danbooru/danbooru/pull/6041
2025-06-18 21:12:18 +02:00
Mike Fährmann
8f79ec67f4 [text] add 'build_query()' 2025-06-18 20:49:12 +02:00
Mike Fährmann
41191bb60a 'match.group(N)' -> 'match[N]' (#7671)
2.5x faster
2025-06-18 13:05:58 +02:00
Mike Fährmann
c73d902e13 [tests/results] include '#comment' in test output 2025-06-17 22:32:08 +02:00
Mike Fährmann
f7b1b6b5c8 [tests] replace 'print' with 'sys.stdout.write' 2025-06-17 22:05:41 +02:00
Mike Fährmann
dd299b3dad [tests/util] add LazyPrompt test 2025-06-17 21:47:21 +02:00
Mike Fährmann
acd5cb0cf0 [tests/util] add HTTPBasicAuth tests 2025-06-17 20:28:16 +02:00
Mike Fährmann
f1343894cb [tests/util] move 'datetime' tests into a separate TestCase class 2025-06-17 20:14:43 +02:00
pocketinternet
3ea244eebb [webtoons] add 'thumbnails' option (#6468 #7441)
* Update webtoons.py
    Added thumbnail download capability which defaults to false
* Update configuration.rst
    Added documentation for webtoon thumbnail option
* extract thumbnails in GalleryExtractor.assets()
* simplify & fix flake8
* include 'type' in default filenames
* add test
* update docs

---------

Co-authored-by: Mike Fährmann <mike_faehrmann@web.de>
2025-06-17 19:34:58 +02:00
Mike Fährmann
6d928f3805 remove some pre-3.8 workarounds (#7671) 2025-06-17 12:56:47 +02:00
SpiffyChatterbox
e0f65be36b [nudostar] add support (#5735 #6556)
* Drafting initial basic extractor layout
* Better debug logging
* Update nudostar.py
    Still tinkering
* Update nudostar.py
    Basic extractor is working. Now starting on Gallery
* Update nudostar.py
    Still a work in progress.
    Got individual posts working, galleries are not.
* Update nudostar.py
* Site now appears working. Added Tests.
* PEP Updates
* PEP - Line Length Updates
* Update nudostar.py
    Resolving PEP8 issues.
* update 'gallery' extractor, rename to 'model'
* update 'image' extractor
* expand tests
* update docs/supportedsites

---------

Co-authored-by: Mike Fährmann <mike_faehrmann@web.de>
2025-06-16 19:21:49 +02:00
missionfloyd
13cb031fe2 [girlsreleased] add support (#6200)
* [girlsreleased] add support
* Lint
* Change "galleries" to "sets"
    As it is on the site
* Add pagination
    Add tests
* Fix tests
* Remove leftover print()
* Don't remove first set
* Yield pages
* Add filename metadata
* [girlsreleased] Refactor
* Return models as array
* Add filename numbering
    Add date metadata
* Add URL metadata
* Spawn set extractor the right way
* Adjust model/site regex
* update
    - restructure some code
    - remove constructors
    - use f-strings
* expand tests
* update docs/supportedsites
2025-06-16 19:18:19 +02:00
SpiffyChatterbox
48ac41605d [redbust] add support (#6759 #6918 #7043)
* init - Redbust.com Support
* Added Test
    Could use a second set of eyes on this
* update 'gallery' extractor
    - extract more metadata
    - simplify image extraction
    - support legacy galleries
* add tests
* update 'image' extractor
* add 'tag' extractor
* add 'archive' extractor
* restrict 'image' extractor pattern
* update docs/supportedsites
* replace quotes inside f-string

---------

Co-authored-by: Mike Fährmann <mike_faehrmann@web.de>
2025-06-16 12:10:42 +02:00
hunter-gatherer8
96f5cfb305 [girlswithmuscle] add support (#4493 #6016)
* [girlswithmuscle] init
* [girlswithmuscle]: fix metadata extraction (site layout change)
* [girlswithmuscle]: fix tags extraction (site layout change)
* update login code
* update 'post' extractor
* update 'gallery' extractor, rename to 'search' extractor
* update docs
* add test cases

---------

Co-authored-by: Mike Fährmann <mike_faehrmann@web.de>
2025-06-14 23:05:49 +02:00
Mike Fährmann
40dedd7ce0 [util] restore stdlib 're' module in filter expressions (#7665)
fixes regression introduced in 4fc719bb10
2025-06-13 20:52:33 +02:00
Mike Fährmann
fa1fc39a36 [pp:exec] implement 'commands' option
to run multiple commands in succession
and stopping if one fails
2025-06-13 20:19:01 +02:00
Mike Fährmann
9d3cf67f3e [pinterest] remove excess whitespace from 'description' fields (#4335)
and 'closeup_unified_description' & 'closeup_description'
2025-06-13 13:11:18 +02:00
Mike Fährmann
9afac3c51d [tests:results] rename '#urls' to '#results' 2025-06-13 13:00:50 +02:00
Mike Fährmann
e08ec7e083 update copyright notices 2025-06-13 00:03:41 +02:00
Mike Fährmann
811b665e33 remove @staticmethod decorators
There might have been a time when calling a static method was faster
than a regular method, but that is no longer the case. According to
micro-benchmarks, it is 70% slower in CPython 3.13 and it also makes
executing the code of a class definition slower.
2025-06-12 22:50:52 +02:00
Mike Fährmann
8b6bc54e95 [dl:http] add MIME type and signature for .html files 2025-06-12 21:16:34 +02:00
Mike Fährmann
e2d104a110 [twitter] extract 'source_id' and 'source_user' metadata (#7470 #7640) 2025-06-12 18:59:22 +02:00
Mike Fährmann
16fc5e0d68 [batoto] fix downloading manga with alerts/notices (#7657)
and improve alert message extraction
2025-06-12 08:26:26 +02:00
Mike Fährmann
847fce6ef1 [tests:results] implement testing for logging messages 2025-06-12 08:22:28 +02:00
Mike Fährmann
d065452ba3 merge #7653: [archivedmoe] fix redirection issue (#7652) 2025-06-11 20:04:42 +02:00
Mike Fährmann
80599fa610 [vk] fix 'user' metadata extraction
add boolean 'group' field
2025-06-11 20:01:27 +02:00
Mike Fährmann
8645be8244 [archivedmoe] add redirect URL fixup test 2025-06-11 12:10:18 +02:00
Mike Fährmann
8e698d1a64 [ytdl] set domain as subcategory when using Generic extractor (#6582)
https://github.com/mikf/gallery-dl/issues/6582#issuecomment-2959879730
2025-06-10 21:35:15 +02:00
Mike Fährmann
e68555defa [common] improve cookie-related logging messages 2025-06-10 21:34:27 +02:00
Mike Fährmann
a25e14e776 [dl:http] implement dynamic download 'rate' limits (#7638) 2025-06-08 20:04:31 +02:00
Mike Fährmann
e84df260c0 [util] generalize 'build_duration_func' 2025-06-08 20:01:16 +02:00
Mike Fährmann
827eeca0bc [paheal] fix '404 Not Found' for tags with URL encoded characters (#7642) 2025-06-08 16:23:11 +02:00
Mike Fährmann
b95b71f63f [tests:results] implement general tests for list items 2025-06-07 10:36:43 +02:00
Mike Fährmann
fc01d85823 [tests:results] split '_test_kwdict()' 2025-06-07 10:16:47 +02:00
Mike Fährmann
6e120f2551 [danbooru] fix Ugoira for instances without 'Ugoira:FrameMimeType'
(#7630)

fixes regression introduced in 1866f8b97b
2025-06-07 07:47:03 +02:00
Mike Fährmann
3e423937d2 [misskey] implement 'include' option (#5347) 2025-06-06 20:52:03 +02:00
Mike Fährmann
ac09cac978 [misskey] add 'avatar' and 'background' extractors (#5347) 2025-06-06 20:14:05 +02:00
Mike Fährmann
9c4cef822e [komikcast] update domain to 'komikcast02.com' 2025-06-06 20:14:02 +02:00
Mike Fährmann
b5c88b3d3e replace standard library 're' uses with 'util.re()' 2025-06-06 13:24:52 +02:00
Mike Fährmann
e1f03a5a93 [util] allow 'noop()' to accept one optional argument
fixes 'NullResponse.close()'
2025-06-05 20:08:24 +02:00
Mike Fährmann
685836f6fd [dynastyscans] add 'anthology' extractor (#7627) 2025-06-04 21:23:49 +02:00
Mike Fährmann
b5334f5837 [everia] prevent redirect when fetching a post page 2025-06-04 11:09:40 +02:00
missionfloyd
72e1a4a0cb [everia] unquote URLs (#7620)
* [everia.club] unescape URLs

* add test
2025-06-04 09:38:06 +02:00
Mike Fährmann
3c6c40d4ed [nijie] fix file extraction (#7624)
ignore empty URLs / URLs with no 'src="'
2025-06-04 07:57:27 +02:00
Mike Fährmann
4fc719bb10 [util] add 're' helper 2025-06-04 07:56:43 +02:00