Amar Paul
b552cdba04
[pholder] add support ( #2568 #9067 )
...
* feat: extractor for pholder.com
Closes #2568
* feat[pholder]: support gallery_id properly and tags
* doc[text.nameext_from_name]: minor typo in docstring
* remove '__init__' & 'request' methods and 'json' import
* use 'text.nameext_from_url' to ensure a 'filename' value
* fix 'imgur' links by disabling auto-Referer
* fix 'data["id"].partition()' call
'partition' returns 3 elements
* use 'item["_source"]' data directly
* remove unused supportedsites overwrite
* catch all exceptions in '_thumb_resolution'
fixes "KeyError: 'width'"
* use 'author' name for user folders
---------
Co-authored-by: Mike Fährmann <mike_faehrmann@web.de >
2026-02-15 09:46:18 +01:00
Mike Fährmann
d2477a94af
[options] add 'sleep-retries' option
2026-02-13 18:04:05 +01:00
Mike Fährmann
e6f2e31d45
[pp:ugoira] disable 'mkvmerge' output, add 'mkvmerge-output' option
2026-02-12 22:10:20 +01:00
Mike Fährmann
0f41f343f4
implement linear/exponential backoff for 'sleep-429'
2026-02-12 19:23:29 +01:00
Mike Fährmann
d491564f8a
[instagram] add 'user-strategy' option ( #8978 #9025 )
2026-02-10 16:46:33 +01:00
Mike Fährmann
a8376f2804
[instagram] add 'user-cache' option ( #8978 #9025 )
2026-02-10 12:01:37 +01:00
Mike Fährmann
935bdb6229
[reddit:user] implement 'only' option ( #8228 )
2026-02-07 21:47:17 +01:00
Mike Fährmann
2d64e76223
[job] implement 'follow' option ( #8752 )
...
Follow and process URLs found in the given format string result.
2026-02-07 21:47:17 +01:00
Mike Fährmann
98ef34a9be
[twitter] support 'article' media ( #8995 )
2026-02-07 21:47:17 +01:00
Mike Fährmann
71680feab9
[actions] implement 'flag … = skip' ( #8960 )
2026-02-06 10:10:38 +01:00
Mike Fährmann
42407afb6d
[xenforo] implement '"order-posts": "reaction"' ( #8997 )
2026-02-04 21:57:30 +01:00
Mike Fährmann
3d36ee0e53
[common] support multiple codes/blocks for '--xff'
...
for example '--xff JP,CN,105.48.0.0/12'
2026-02-04 20:58:59 +01:00
Mike Fährmann
947085c6e3
[common] add '--xff' / 'geo-bypass' option
2026-02-04 18:45:02 +01:00
Mike Fährmann
b329e6e472
[xenforo] add 'attachments' & 'embeds' options
2026-02-04 16:16:55 +01:00
Mike Fährmann
614e1a514a
[twitter] implement 'metadata-user' option ( #8984 )
2026-02-03 20:14:34 +01:00
Mike Fährmann
2cbd9740f2
enable 'metadata-url/-path/-parent/-extractor]' by default
...
assign values to "private" keys to not have them appear in the
standard JSON output, while still making it easier to access them
2026-02-03 19:28:33 +01:00
Mike Fährmann
5ddee423ed
[exhentai][schalenetwork] enable 'tags' categories by default
2026-02-03 10:49:35 +01:00
Mike Fährmann
d232f0d252
[gofile][pixeldrain] enable 'recursive' by default
2026-02-03 09:11:47 +01:00
Mike Fährmann
25d45478d6
[docs/configuration] document postprocessor /MODE and @EVENT ( #8980 )
...
4e9cb428d6
2026-02-02 22:08:10 +01:00
Mike Fährmann
e37fe053d2
[docs/configuration] use anchors for alternative/deprecated names
...
instead of doubled option titles
2026-02-02 19:10:20 +01:00
Mike Fährmann
87323fd71e
rename 'image-*' & 'chapter-*' options to 'file-*' & 'child-*'
...
keep support for both names, e.g. file-filter & image-filter,
while prioritizing the new names
2026-02-02 18:01:45 +01:00
Mike Fährmann
153e589dd0
[twitter] implement '"search-pagination": "date"'
2026-02-02 16:19:48 +01:00
Mike Fährmann
44e18f9b2f
[tsumino] remove module
...
" Tsumino - The End
We're shutting Tsumino down. "
2026-02-01 22:15:06 +01:00
Mike Fährmann
c42a5dce5c
[pp] add 'child' & 'child-after' events
2026-02-01 22:14:14 +01:00
Mike Fährmann
56dcd00391
[job] add 'init' option
2026-02-01 20:48:37 +01:00
bassberry
fd5f5611f6
[tiktok] extract subtitles and all cover types ( #8805 )
...
* Make sure that `img_id`, `audio_id` and `cover_id` fields are always available.
The values are set '' where they are not applicable.
Having `img_id` is necessary for the default `archive_fmt`, the other fields are handled for consistency.
* Allow downloading more than one cover.
The previous behavior is kept as-is, but setting the "covers" option to "all" now grabs all available covers.
* Add support for downloading subtitles
Allows filtering subtitles by source type (ASR, MT) and language.
* Ensure archive uniqueness for covers and subtitles.
* Update the URL test pattern to include the `image` extension.
Although Tiktok may serve the covers with jpeg content, the file ending can be `.image`.
The test before 0c14b164 failed because the asserted URL did not match all cover types, but the now used pattern needs the mentioned file ending.
* Add support for "creator_caption" subtitles in "LC" format.
These subtitles have the keys "Format" set to "creator_caption" and "Source" to "LC".
* Add "LC" (Local Captions) as a subtitle source type in the documentation
* Code deduplication and renaming subtitle metadata
Changed the item type from singular `subtitle` to `subtitles`.
Removed the wrong descriptor `cover` from the subtitles fallback title.
* Refactor subtitle filtering
The filter is now prepared in `_init` to prevent parsing the same config parameter for every item.
The `_extract_subtitles` function will still extract if either filter (source or language) matches.
* Generate a `file_id` for subtitles
Subtitles have multiple fields that determine the unique file, so these are simply concatenated.
This is similar to the cover types, only with more variations.
* Added tests for subtitles
* fix docs entries
* fix '"covers": "all"'
* simplify some code
* Fix fallback title for subtitles
Added the missing "f" to the f-string and added "subtitle" to the title.
The resulting title will look like "TikTok video subtitle #1234567 "
2026-01-30 21:01:06 +01:00
Mike Fährmann
3445c51ca4
[job] add 'output.jsonl' option ( #8953 )
2026-01-30 09:36:28 +01:00
Mike Fährmann
56168fbc87
[weebdex] add 'lang' option, support query params ( #8957 )
...
for example '?order=asc&group=j0fsj3oem3&tlang=en'
2026-01-29 17:01:02 +01:00
Mike Fährmann
eaaa25b6e4
[job] enable all 'parent-…' options for parent extractors by default
...
- parent-directory
- parent-metadata
- parent-session
- parent-skip
- add general 'parent' option
2026-01-27 12:05:19 +01:00
Mike Fährmann
adca123646
[weibo:user] add 'subalbums' include ( #8792 )
2026-01-25 11:16:41 +01:00
Mike Fährmann
e93cfa3348
[twitter] implement '"ratelimit": "abort:N"' ( #5251 #8864 )
2026-01-23 19:54:28 +01:00
Mike Fährmann
f869085476
[weebdex] add 'data-saver' option ( #8914 )
2026-01-23 09:22:41 +01:00
Mike Fährmann
18fabb9605
[batoto] remove module ( #8908 )
...
"Bato.to has shut down."
There are mirror sites, but they are unscrapeable
due to heavily obfuscated HTML and JS
2026-01-21 20:33:08 +01:00
Mike Fährmann
4798ac4836
[common] implement 'parent-session'
2026-01-21 20:33:08 +01:00
Mike Fährmann
63df6423bf
[nitter] use 'gallery-dl/<version>' User-Agent ( #7045 #8130 #8409 )
2026-01-21 18:07:47 +01:00
Mike Fährmann
78da7edde8
[common] add 'googlebot' User-Agent preset
2026-01-21 17:57:26 +01:00
brerk
e00c717b15
[turbo] update 'saint' extractors ( #8893 #8896 )
...
* Implements turbo.py & remove from domain pattern from saints.py
* Remove leftover commented pattern from saints.py
* Make turbo.py comply with flake8
* Add album support
* Improved metadata extracion for albums and single files & created turbo.py tests using saints.py test
* Align turbo.py extractor with flake8 rules
* Fix #class name on turbo.py tests
* Fix #category test
* Fix #category test x2
* Fix #category tests
* Fix #category tests
* Fix TurboMediaExtractor self.groups unpacking
* update basic module formatting
* replace 'saint' with 'turbo' in modules list
* remove saint extractors and tests
* update & simplify 'media' extractor
* update & simplify 'album' extractor
* update tests
* update supportedsites
* update 'category-map' & 'config-map'
---------
Co-authored-by: Mike Fährmann <mike_faehrmann@web.de >
2026-01-19 22:20:13 +01:00
Mike Fährmann
254bb4505c
[chevereto] implement password support
2026-01-16 09:33:55 +01:00
Mike Fährmann
fea4f4c30b
[pixeldrain:album] add 'zip' option
2026-01-14 12:03:52 +01:00
Mike Fährmann
29e8440ec4
[formatter] add 'format-operator-dot' option
...
to allow disabling the behavior introduced in commit
a79a945494
2026-01-13 22:25:37 +01:00
camellia2077
084a6d73e0
[bilibili] add support for Live Photo (video) downloads ( #8860 )
...
* bilibili: add support for live photo downloads
* fix: resolve flake8 linting errors (whitespace and line length)
* fix: resolve flake8 E302 and W293 linting errors
* fix: resolve flake8 W293 and E302 linting errors
* simplify syntax
* add 'livephoto' option
* add tests
2026-01-10 19:27:34 +01:00
Mike Fährmann
7e513f35f2
[wikimedia] use 'gallery-dl/<version>' User-Agent ( #8770 #8861 )
2026-01-10 18:08:13 +01:00
Mike Fährmann
60104f20af
[xenforo] implement 'media' support ( #8785 )
...
add 'media-user' and 'media-item' extractors
TODO: 'media-category' extractor (?)
2026-01-09 21:21:19 +01:00
Mike Fährmann
b1a708beb9
[weibo:album] implement 'subalbum' support ( #8792 )
...
https://weibo.com/u/USER?tabtype=album-1
https://weibo.com/u/USER?tabtype=album-NAME
https://weibo.com/u/USER?tabtype=album-all
https://weibo.com/u/USER?tabtype=album-only
2026-01-07 19:23:54 +01:00
Mike Fährmann
6cc7153b6c
[webtoons] extend 'bgm' option ( #8733 )
...
allow remuxing bgm audio into a different format/container
2026-01-06 21:53:16 +01:00
Mike Fährmann
cf96fc6ebe
[koofr] refactor ( #8803 )
...
https://github.com/mikf/gallery-dl/issues/8803#issuecomment-3708215475
https://github.com/mikf/gallery-dl/issues/8803#issuecomment-3708358606
- add 'recursive' option, remove 'zip'
- recurse into subdirectories
- add 'path' metadata
- remove 'count' & 'num' metadata
- update default directory & archive format
2026-01-06 21:52:58 +01:00
Mike Fährmann
2cfe2b3b04
[common] implement '"user-agent": "+PRESET"'
...
support using a 'User-Agent' header preset, e.g. "+firefox"
2026-01-03 21:07:45 +01:00
Mike Fährmann
fcf6f05a0d
[koofr] implement 'zip' option ( #6582 #8700 )
...
Download individual files instead of '.zip' bundle by default.
https://github.com/mikf/gallery-dl/issues/6582#issuecomment-3679504391
2026-01-01 19:13:31 +01:00
Mike Fährmann
5e94bae906
[pixiv] support 'sketch' include ( #8789 )
2026-01-01 10:49:41 +01:00
Mike Fährmann
239de3f092
[civitai] update 'quality-videos' default ( #8787 )
...
from "quality=100" to "original=true,quality=100"
to prevent a server timeout / error
2025-12-31 09:56:39 +01:00