Commit Graph

7627 Commits

Author SHA1 Message Date
Mike Fährmann
c745a57bd7 [dankefuerslesen] extend URL pattern (#8811)
- https://danke.moe/read/manga/MANGA/
- https://danke.moe/read/series/MANGA/
- https://danke.moe/reader/manga/MANGA/
- https://danke.moe/reader/series/MANGA/
2026-01-03 13:19:20 +01:00
Mike Fährmann
a28fbbc3c6 [util] rename 'USERAGENT' to 'USERAGENT_GALLERYDL' 2026-01-03 10:31:48 +01:00
Mike Fährmann
e33a63231f [util] adjust Firefox/Chrome UA versions
modify 'ordinal' offsets to get the correct version numbers
2026-01-03 10:31:32 +01:00
Mike Fährmann
02aab26a17 release version 1.31.2 2026-01-02 19:32:54 +01:00
Mike Fährmann
44c2408904 [tiktok] update user ID extraction 2026-01-01 22:28:21 +01:00
Mike Fährmann
2a342e4b1a [tiktok] combine '…_id' fields into a single 'file_id' one (#8804)
add a proper 'file_id' value for 'video' files
2026-01-01 21:49:09 +01:00
Mike Fährmann
fcf6f05a0d [koofr] implement 'zip' option (#6582 #8700)
Download individual files instead of '.zip' bundle by default.

https://github.com/mikf/gallery-dl/issues/6582#issuecomment-3679504391
2026-01-01 19:13:31 +01:00
Benjamin Vergnaud
3b316c857f [bunkr] detect new maintenance video file (#8802)
* fix(bunkr): catch new maintenance video file during validation

Bunkr has performing a large scale maintenance recently, and I've
noticed that I would on occasion download a file only to get
a short server is in maintenance video instead of the intended file.

Debug logs show the filename for the maintenance video has changed:
[urllib3.connectionpool][debug] Starting new HTTPS connection (1): 3d09xl1.b-cdn.net:443
[urllib3.connectionpool][debug] https://3d09xl1.b-cdn.net:443 "GET /c4f36040-bdd1-40b6-aea1-034dfbe88ba2/maint.mp4 HTTP/1.1" 200 322509

This commit fixes the detection of that file to properly send the
"File server in maintenance mode" warning instead of downloading an
unintended file.

Signed-off-by: Benjamin VERGNAUD <ben@bvergnaud.fr>

* check multiple suffixes with one 'endswith()' call

---------

Signed-off-by: Benjamin VERGNAUD <ben@bvergnaud.fr>
2026-01-01 15:53:07 +01:00
Mike Fährmann
5e94bae906 [pixiv] support 'sketch' include (#8789) 2026-01-01 10:49:41 +01:00
Mike Fährmann
4e56be4def [pixiv] provide 'count' metadata (#8794) 2026-01-01 10:15:04 +01:00
Mike Fährmann
b1cae08507 [instagram] don't warn on minor size differences (#8300)
only emit a warning with 'warn-images' if the reported original size
is at least 10% smaller then the size of the returned image
2026-01-01 09:54:07 +01:00
Mike Fährmann
9b6b8deb9e [weibo] fix 'article' include (#8792) 2025-12-31 19:05:11 +01:00
Mike Fährmann
8792d7faea [whyp] add support (#8725) 2025-12-31 18:58:33 +01:00
Duy NGUYEN
2087730b75 [batoto] replace k-subdomain image URLs with n-subdomain (#8791)
* [batoto] replace k-subdomain image URLs with n-subdomain
    Update tests to use battwo.com proxy domain and current metadata
* [batoto] revert to list comprehension
2025-12-31 16:32:03 +01:00
Mike Fährmann
add9fbc13b [manganelo] add 'bookmark' extractor (#8776) 2025-12-31 11:06:08 +01:00
Mike Fährmann
9ff3cc4a8e [xenforo] fix incomplete video URLs (#8786) 2025-12-31 10:48:51 +01:00
Mike Fährmann
2a629fd653 [xenforo] support '/#post-ID' URLs 2025-12-31 10:43:53 +01:00
Mike Fährmann
239de3f092 [civitai] update 'quality-videos' default (#8787)
from "quality=100" to "original=true,quality=100"
to prevent a server timeout / error
2025-12-31 09:56:39 +01:00
Mike Fährmann
c91052f257 [tapas] unescape HTML entities in image URLs (#8790) 2025-12-31 08:47:37 +01:00
Mike Fährmann
51c155df94 [yourlesbians] add 'album' extractor (#8713) 2025-12-30 20:33:50 +01:00
Mike Fährmann
a97c320a38 [sankaku] fix re-authentication (#8779)
Unset the `Authorization` header before performing a re-login
2025-12-30 17:25:24 +01:00
CasualYouTuber31
a6c845bdc8 [tiktok] remove yt-dlp dependency & add support for more post types (#8715)
#7246 #8035 #8466 #8730

* [tiktok] support extracting videos directly without yt-dlp
* [tiktok] support extracting users directly without yt-dlp
* [tiktok] fixing logic, tests, linting errors
* [tiktok] implement tiktok-range support for non-yt-dlp user extractor
* [tiktok] Skip range filter if no ranges are given
* [tiktok] Remove debug code
* [tiktok] only check for faulty device IDs during the first couple of passes
    I think the original yt-dlp solution assumes that if a device ID works once, it will always work.
    Plus, my approach would cause needless retries in certain cases if hasMorePrevious does end up being wrong like the original algorithm accounts for. So let's copy the original algorithm here, too.
* [tiktok] support stories
* [tiktok] you can now extract audio without extracting photos
* [tiktok] add TiktokFollowingExtractor
* [tiktok] update supportedsites to include stories
* [tiktok] Keep tiktok-range option for no content user account test
    It acts as a nice guard against that account suddenly having lots of posts to extract
* [tiktok] TiktokUserExtractor and TiktokFollowingExtractor rewrite
* [tiktok] Fix avatar naming convention to match that of posts
* [tiktok] remove type hints for compatibility with older Python versions
* [tiktok] Improve performance of TiktokFollowingExtractor
    This was largely achieved using the story/batch/item_list endpoint
* [tiktok] Forgot to run flake8
* [tiktok] remove old constant
* [tiktok] Support order-posts config item
* [tiktok] flake8
* [tiktok] Older Python versions don't support match
* [tiktok] always ask for posts in chronological order when in "desc" mode
    We should aim to avoid having pinned posts returned before non-pinned ones
* [tiktok] Add liked posts extraction
* [tiktok] Add reposts extraction
* [tiktok] Add saved posts extraction

* cleanup imports
* remove '# MARK:' comments
* remove & simplify 'except' statements
    KeyboardInterrupt & SystemExit inherit from BaseException (not Exception)
    and therefore don't need special handling
* split 'user' extractor
* move PATTERNs into their respective functions
* use dict comprehensions
* add only-matching test URLs for split user extractors
* update config docs
    rename 'tiktok-user-extractor' to 'ytdl'
* document '"popular"' 'order-posts' value
* inline and remove 'util.chunk()'
2025-12-30 17:17:57 +01:00
Mike Fährmann
c8c4575c7f [dl:http] add MIME type and signature for .aac files 2025-12-29 19:05:34 +01:00
Mike Fährmann
c2917ad5c5 [ahottie] add aupport (#8710) 2025-12-29 18:47:10 +01:00
Mike Fährmann
ec2267244f [tumblr:search] prevent KeyError when using 'offset' pagination (#8720) 2025-12-29 16:57:04 +01:00
Mike Fährmann
ebfd87bbb0 [instagram] detect homepage redirects (#8714) 2025-12-29 16:28:37 +01:00
Mike Fährmann
e12b17649a [dl:ytdl] support '_ytdl_params' 2025-12-28 23:35:21 +01:00
Mike Fährmann
5427e337d4 [dl:ytdl] unset '_mtime_http' (#8767)
prevent an mtime value from a previous HTTP download
from being reused for a file downloaded by 'ytdl'
2025-12-28 22:46:52 +01:00
Mike Fährmann
4f535ec606 [imgpv] add 'image' extractor (#8773) 2025-12-28 21:36:06 +01:00
Oleguer Llopart
bea0e16970 [webtoons] extract 'num_play' & 'num_stop' for background music (#8733 #8755)
* [webtoons] play_image and stop_image in background music
* [webtoons] add play_image and stop_image fields to test results
* [webtoons]: flake8
* [webtoons]: flake8 (2)

* update & improve
    - remove global 'images_urls'
    - move enumerate loop into images() extraction
* simplify 'play_image' etc extraction
* rename 'play_image' etc
    - num_play
    - num_stop
    - filename_play
    - filename_stop

---------

Co-authored-by: Mike Fährmann <mike_faehrmann@web.de>
2025-12-27 09:09:10 +01:00
Mike Fährmann
f2efccaadf [instagram:stories] extract correct 'expires' dates (#8764) 2025-12-26 20:27:43 +01:00
Mike Fährmann
05e6340867 [ytdl] expect all exception types when extracting info_dict (#8343)
not only 'YoutubeDLError's
2025-12-26 20:27:04 +01:00
Mike Fährmann
daf6cc2c78 [twitter] add 'notifications' extractor (#7974) 2025-12-26 18:06:00 +01:00
Mike Fährmann
fd6f19347a [actions] fix 'abort' (#8753)
fixes regression introduced in 2eb5e52055

forward the correct arguments to StopExtraction.__init__
instead of logging message and 'extras'
2025-12-26 14:07:48 +01:00
Mike Fährmann
c384cd06c2 [fansly] add 'previews' option (#8686) 2025-12-25 20:25:47 +01:00
Mike Fährmann
01fe6870a3 [fansly] disable 'formats' check by default (#8757)
should allow downloading and selecting the highest resolution video format
regardless of its 'type' / format code
2025-12-25 19:15:21 +01:00
Mike Fährmann
6845661123 [discord] update asset collection & handling (#8589) 2025-12-25 18:22:55 +01:00
Mike Fährmann
57c2327ae3 [tests/results] implement more specific '#auth' checks 2025-12-23 22:17:33 +01:00
Mike Fährmann
9c50c10f1a [deviantart:stash] fix JSONDecodeEerror for folders (#8750)
remove extra ',"submit":{}' after end of JSON object
2025-12-23 20:54:46 +01:00
Mike Fährmann
cc8cd78103 [webtoons] download episode background music (#8733)
add 'bgm' option
2025-12-23 19:08:30 +01:00
Mike Fährmann
2be54be692 [subscribestar] merge 'user-tag' into regular 'user' extractor (#8737) 2025-12-23 18:58:25 +01:00
Mike Fährmann
c3ef760a9f [discord] add 'server-assets' extractor (#8589) 2025-12-22 21:08:33 +01:00
Mike Fährmann
eda6957d4c [imagehosts] remove 'splitext' import 2025-12-22 17:29:29 +01:00
Mike Fährmann
568faee726 [imagehosts] remove '_https' 2025-12-22 11:48:08 +01:00
Mike Fährmann
7669a1f13a [subscribestar:user-tag] update 'pattern' 2025-12-22 11:43:30 +01:00
Mike Fährmann
b5a7540619 Merge branch 'op+': use '+' for 2-element string concatenations 2025-12-22 11:34:21 +01:00
Mike Fährmann
b69e5a7a58 [scripts/init] use 'operator +' when building 'pattern' 2025-12-22 11:31:48 +01:00
Mike Fährmann
00c6821a3f replace 2-element f-strings with simple '+' concatenations
Python's 'ast' module and its 'NodeVisitor' class
were incredibly helpful in identifying these
2025-12-22 11:26:04 +01:00
Mike Fährmann
457c8b2e76 [exhentai] fix possible exception in 'finalize()' (#8741) 2025-12-22 11:22:15 +01:00
Mike Fährmann
609e19273d [subscribestar] add 'user-tag' extractor (#8737) 2025-12-21 22:14:17 +01:00