Mike Fährmann
bcfce6b7db
[dl:http] improve HTML signature check ( #7697 )
...
https://github.com/mikf/gallery-dl/issues/7697#issuecomment-2990734451
ignore leading whitespace
2025-06-20 14:39:32 +02:00
Mike Fährmann
dba45923c8
[dl:http] fail downloads with HTML content ( #7697 )
...
add 'validate-html' option
2025-06-20 14:34:40 +02:00
Mike Fährmann
e8d28d3d0d
merge #7699 : [warosu] handle missing images by checking hostname ( #7698 )
2025-06-20 13:17:44 +02:00
Mike Fährmann
bb73c8318c
[warosu] add test for non-archived post
2025-06-20 12:38:54 +02:00
NecRaul
5ba7c98bc2
[warosu] Simpler/less costly hostname check
2025-06-20 14:27:21 +04:00
NecRaul
3c85032b9b
[warosu] Handle missing images in the extractor by checking hostname
2025-06-20 09:34:49 +04:00
Mike Fährmann
4279928d0b
[gelbooru_v02] extract 'total' / 'search_count' metadata ( #7689 )
2025-06-19 19:15:27 +02:00
Mike Fährmann
fcd1b8a155
[common] add a 'kwdict' member to extractor instances
...
to allow setting general metadata at any point and without having to
rely on a manually implemented 'metadata()' method
2025-06-19 19:08:35 +02:00
Mike Fährmann
a80d55d974
[common] improve 'user-agent' override logic
...
Prevent a general 'user-agent' setting, i.e. extractor.user-agent, from
overriding the User-Agent header set by an extractor's 'browser'
option, default or otherwise, meaning only extractor-level or top-level
'user-agent' settings will override the 'browser' User-Agent.
https://github.com/mikf/gallery-dl/issues/7382#issuecomment-2985296321
2025-06-19 18:58:39 +02:00
Mike Fährmann
6d7ca4890f
[tests] add missing '\n' to stdout.write() output
2025-06-19 11:39:18 +02:00
Mike Fährmann
4f7a0e5d02
[arcalive] fix download URLs ( #7678 )
...
https://github.com/danbooru/danbooru/pull/6041
2025-06-18 21:12:18 +02:00
Mike Fährmann
8f79ec67f4
[text] add 'build_query()'
2025-06-18 20:49:12 +02:00
Mike Fährmann
b0580aba86
update 'match.lastindex' usage
2025-06-18 20:24:13 +02:00
Mike Fährmann
41191bb60a
'match.group(N)' -> 'match[N]' ( #7671 )
...
2.5x faster
2025-06-18 13:05:58 +02:00
Mike Fährmann
475506cc39
[util] more micro optimizations
2025-06-18 12:27:18 +02:00
Mike Fährmann
c73d902e13
[tests/results] include '#comment' in test output
2025-06-17 22:32:08 +02:00
Mike Fährmann
f7b1b6b5c8
[tests] replace 'print' with 'sys.stdout.write'
2025-06-17 22:05:41 +02:00
Mike Fährmann
dd299b3dad
[tests/util] add LazyPrompt test
2025-06-17 21:47:21 +02:00
Mike Fährmann
acd5cb0cf0
[tests/util] add HTTPBasicAuth tests
2025-06-17 20:28:16 +02:00
Mike Fährmann
f1343894cb
[tests/util] move 'datetime' tests into a separate TestCase class
2025-06-17 20:14:43 +02:00
pocketinternet
3ea244eebb
[webtoons] add 'thumbnails' option ( #6468 #7441 )
...
* Update webtoons.py
Added thumbnail download capability which defaults to false
* Update configuration.rst
Added documentation for webtoon thumbnail option
* extract thumbnails in GalleryExtractor.assets()
* simplify & fix flake8
* include 'type' in default filenames
* add test
* update docs
---------
Co-authored-by: Mike Fährmann <mike_faehrmann@web.de >
2025-06-17 19:34:58 +02:00
Mike Fährmann
1f429da650
[scripts/options] make output width independent of terminal size
2025-06-17 18:52:46 +02:00
Mike Fährmann
c88d69376f
[util] update code to 3.8
2025-06-17 17:06:00 +02:00
Mike Fährmann
6d928f3805
remove some pre-3.8 workarounds ( #7671 )
2025-06-17 12:56:47 +02:00
Mike Fährmann
d0b1da0fa7
[tiktok] handle exceptions when extracting avatars ( #7682 )
2025-06-17 12:21:20 +02:00
Mike Fährmann
eaed644b85
[build] update setup.py
...
- increase required Python version to 3.8 (#7671 )
- remove 3.4-3.7 classifiers
- add 'extra' optional dependency group
2025-06-17 11:48:54 +02:00
Mike Fährmann
efa2ab5903
[common] allow GalleryExtractors to return additional asset files
...
like thumbnails or covers
2025-06-16 22:45:52 +02:00
SpiffyChatterbox
e0f65be36b
[nudostar] add support ( #5735 #6556 )
...
* Drafting initial basic extractor layout
* Better debug logging
* Update nudostar.py
Still tinkering
* Update nudostar.py
Basic extractor is working. Now starting on Gallery
* Update nudostar.py
Still a work in progress.
Got individual posts working, galleries are not.
* Update nudostar.py
* Site now appears working. Added Tests.
* PEP Updates
* PEP - Line Length Updates
* Update nudostar.py
Resolving PEP8 issues.
* update 'gallery' extractor, rename to 'model'
* update 'image' extractor
* expand tests
* update docs/supportedsites
---------
Co-authored-by: Mike Fährmann <mike_faehrmann@web.de >
2025-06-16 19:21:49 +02:00
missionfloyd
13cb031fe2
[girlsreleased] add support ( #6200 )
...
* [girlsreleased] add support
* Lint
* Change "galleries" to "sets"
As it is on the site
* Add pagination
Add tests
* Fix tests
* Remove leftover print()
* Don't remove first set
* Yield pages
* Add filename metadata
* [girlsreleased] Refactor
* Return models as array
* Add filename numbering
Add date metadata
* Add URL metadata
* Spawn set extractor the right way
* Adjust model/site regex
* update
- restructure some code
- remove constructors
- use f-strings
* expand tests
* update docs/supportedsites
2025-06-16 19:18:19 +02:00
SpiffyChatterbox
48ac41605d
[redbust] add support ( #6759 #6918 #7043 )
...
* init - Redbust.com Support
* Added Test
Could use a second set of eyes on this
* update 'gallery' extractor
- extract more metadata
- simplify image extraction
- support legacy galleries
* add tests
* update 'image' extractor
* add 'tag' extractor
* add 'archive' extractor
* restrict 'image' extractor pattern
* update docs/supportedsites
* replace quotes inside f-string
---------
Co-authored-by: Mike Fährmann <mike_faehrmann@web.de >
2025-06-16 12:10:42 +02:00
Mike Fährmann
6227b64de2
[workflows:tests] include Python 3.7
2025-06-16 12:03:55 +02:00
Mike Fährmann
d9432ee297
[warosu] restore correct 'now' values
2025-06-16 12:00:59 +02:00
Mike Fährmann
489eccb659
merge #7677 : [warosu] HTML attribute fix ( #7676 )
2025-06-16 11:51:08 +02:00
NecRaul
3bc6bc7c77
[warosu] Single quotes when string has a quotation mark
2025-06-16 12:50:20 +04:00
NecRaul
f56e810f42
[warosu] Attribute fix
2025-06-16 12:33:16 +04:00
Mike Fährmann
0a3fac2dfe
merge #7664 : [archivedmoe] redirect URL fixes ( #7652 )
2025-06-15 10:03:34 +02:00
Mike Fährmann
b245218c1d
[archivedmoe] reword some comments and variable names
2025-06-15 10:00:45 +02:00
hunter-gatherer8
96f5cfb305
[girlswithmuscle] add support ( #4493 #6016 )
...
* [girlswithmuscle] init
* [girlswithmuscle]: fix metadata extraction (site layout change)
* [girlswithmuscle]: fix tags extraction (site layout change)
* update login code
* update 'post' extractor
* update 'gallery' extractor, rename to 'search' extractor
* update docs
* add test cases
---------
Co-authored-by: Mike Fährmann <mike_faehrmann@web.de >
2025-06-14 23:05:49 +02:00
Mike Fährmann
b583891df6
[pixiv] remove '/jump.php' from *all* AJAX caption links ( #4327 )
...
https://github.com/mikf/gallery-dl/issues/4327#issuecomment-2969765775
2025-06-13 21:10:51 +02:00
Mike Fährmann
40dedd7ce0
[util] restore stdlib 're' module in filter expressions ( #7665 )
...
fixes regression introduced in 4fc719bb10
2025-06-13 20:52:33 +02:00
Mike Fährmann
fa1fc39a36
[pp:exec] implement 'commands' option
...
to run multiple commands in succession
and stopping if one fails
2025-06-13 20:19:01 +02:00
NecRaul
6668acf91e
[archivedmoe] Sort boards alphabetically
2025-06-13 19:29:47 +04:00
NecRaul
3a4e19d284
[archivedmoe] Simplify board extraction from url
2025-06-13 18:44:02 +04:00
NecRaul
a7aa18a8c1
[archivedmoe] remove unnecessary logging
2025-06-13 18:28:21 +04:00
NecRaul
8b2adeb41e
[archivedmoe] simplify board URL redirection logic
2025-06-13 18:26:39 +04:00
NecRaul
05081dea2e
Lint with flake8
2025-06-13 17:56:43 +04:00
NecRaul
223fe960a0
[archivedmoe] redirect URL changes (again)
...
Redirects to warosu.org instead of 4chan's cdn for certain boards
Redirects to archive.4plebs.org instead of 4chan's cdn for /tg/
Slices the filename only if it's redirecting to certain archives
2025-06-13 17:43:16 +04:00
Mike Fährmann
9d3cf67f3e
[pinterest] remove excess whitespace from 'description' fields ( #4335 )
...
and 'closeup_unified_description' & 'closeup_description'
2025-06-13 13:11:18 +02:00
Mike Fährmann
9afac3c51d
[tests:results] rename '#urls' to '#results'
2025-06-13 13:00:50 +02:00
Mike Fährmann
e08ec7e083
update copyright notices
2025-06-13 00:03:41 +02:00