Commit Graph

5110 Commits

Author SHA1 Message Date
Mike Fährmann
aa6c2dcbac [behance] provide 'creator[name]' metadata (#7885) 2025-07-24 15:30:51 +02:00
Mike Fährmann
849a5b191f [behance] use '"browser": "firefox"' by default (#7803 #7877) 2025-07-24 09:31:12 +02:00
Mike Fährmann
f39c8b125f [4chan] detect files containing only null bytes (#7883) 2025-07-24 09:11:52 +02:00
Mike Fährmann
b2a354f0d2 [urlgalleries] do not strip query parameters (#7888)
fixes regression introduced in fc06243b89
2025-07-24 08:46:59 +02:00
Farahat
cf2e5a1619 [leakgallery] add support (#7872)
* add new extractor for leakgallery.com

    Added support for downloading photo and video posts from leakgallery.com.

    Supports:
    * Individual post URLs
    * User profile URLs with pagination via AJAX
    * Optional type/sort filters (e.g. /Photos/MostRecent)
    * Proper file extension handling
    * Creator-based folder structure
    * Compatibility with --download-archive

    Tested locally and functional, but may still need review or improvement.
    
* [leakgallery] add support
    Added leakgallery to extractor module imports so it's recognized and used.
* [leakgallery] update extractor structure
    - Refactored using LeakGalleryExtractorBase to remove duplication
    - Moved init logic into items() using self.groups
    - Replaced re with text.re as per upstream guidance
    - Added creator fallback and media deduplication
    - Aligned structure with gallery-dl maintainer review tips
* [leakgallery] add support
    - Added leakgallery entry to supportedsites.md
    - Includes post, user, trending, and most-liked subcategories
* add exported extractor results
* [leakgallery] fix flake8 style issues
    Cleaned up code to comply with flake8 rules, especially:
    - removed unused imports
    - split long lines >79 chars
    - ensured newline at EOF
    No functional changes made; purely formatting to satisfy CI checks.
* [tests] update extractor results
* [leakgallery] fix flake8 style issues (part 2)
    Fix remaining flake8 issues in leakgallery.py:
    - Reformat line breaks to avoid W503 (line break before binary operator)
    - Wrap long lines to respect E501 (line too long > 79 characters)
    - Cleaned up exception logging for better clarity
    - Confirmed all flake8 checks now pass successfully
    This superseedes the previous commit which partially fixed formatting violations.
* [leakgallery] fix flake8 style issues (part 3)
* [leakgallery] rename extractor classes
* [tests] update extractor results
* [tests] rename extractor results
* [leakgallery] rename extractor classes (part 2)
* [leakgallery] rename example
* update docs/supportedsites
* update test results
    and convert line endings to '\n'
* update
    - convert line endings to '\n'
    - use _pagination method
    - fix logging calls
* return more metadata for _pagination() results
2025-07-22 22:50:25 +02:00
Mike Fährmann
a097a373a9 simplify if statements by using walrus operators (#7671) 2025-07-22 20:57:54 +02:00
Mike Fährmann
a08f4aa1c0 [nhentai] provide 'gallery_id' for pagination results (#7868) 2025-07-22 20:57:54 +02:00
Mike Fährmann
f0de034889 [exception] rename 'LoginRequired' to 'AuthRequired' 2025-07-22 20:57:54 +02:00
Mike Fährmann
e0599d35dc [civitai] fix 'AttributeError' when a file's post was deleted (#7860) 2025-07-20 20:02:09 +02:00
Mike Fährmann
fc06243b89 [urlgalleries] fix extraction (#7858) 2025-07-20 19:55:33 +02:00
Mike Fährmann
923e27f4fd [civitai] fix 'extension' for videos without 'name' and 'mimeType'
use _video_ext instead of _image_ext
2025-07-19 18:30:29 +02:00
Mike Fährmann
0a245e8d80 [civitai] improve '_extract_version_id()'
also check for 'modelVersionIdsManual'
2025-07-19 18:30:29 +02:00
Mike Fährmann
272df323f3 [civitai] ensure 'file' & 'post' data has a 'date' value (#7548)
https://github.com/mikf/gallery-dl/issues/7548#issuecomment-3091492518
2025-07-19 18:30:29 +02:00
Mike Fährmann
0b991148a1 [civitai] rename 'generate' to 'generated' (#7796) 2025-07-19 18:30:29 +02:00
Luca Russo
869a644e07 [facebook] add retries to profile page request (#7725 #7834 #7852) 2025-07-19 10:18:39 +02:00
turaiel
7464a774f6 [inkbunny] enable pool metadata (#7850) 2025-07-19 09:36:26 +02:00
Mike Fährmann
b0a33d402d [sexcom] update 'search' extractor (#7807) 2025-07-18 21:02:32 +02:00
Mike Fährmann
22ec1aad9a [pornhub] fix 'gallery' extractor (#7842)
- update API endpoint
- extract and send 'token' parameter
2025-07-18 19:08:55 +02:00
Mike Fährmann
67a4472bc2 [civitai] add 'generate' extractor (#7796) 2025-07-18 18:34:17 +02:00
Mike Fährmann
fee14e0055 [madokami] improve 'chapter_string' extraction (#7828) 2025-07-17 22:12:30 +02:00
Mike Fährmann
1561284815 [madokami] add 'manga' extractor (#7828) 2025-07-17 20:40:26 +02:00
Mike Fährmann
629db56b12 [common] detect CF & DG challenges in request_json & _xml (#7833)
even when called when fatal=False
2025-07-17 17:17:01 +02:00
Mike Fährmann
64d6e8c8ff [azurelanewiki] prevent Anubis challenge 2025-07-17 16:37:30 +02:00
Mike Fährmann
87cf635f6e [wikimedia] add missing 'self' argument when calling 'prepare()' (#7835)
fixes regression introduced in 811b665e33
2025-07-17 16:30:36 +02:00
Mike Fährmann
b66a476cab [komikcast] update domain to 'komikcast.li' 2025-07-15 08:12:55 +02:00
Mike Fährmann
bbe7faed21 [iwara] update
- add 'include' docs
- simplify code for different result types
- provide 'count' and 'num' metadata
- remove 'search_type' metadata
2025-07-14 18:47:14 +02:00
Mike Fährmann
fca1cd51f5 [gelbooru] improve error for 401 responses (#7674) 2025-07-14 16:54:21 +02:00
NecRaul
089f815560 [motherless] Add group support (#7774 #7787)
* [motherless] Add group support
* [motherless] Update old unit tests
* [motherless] Add new unit tests
* [motherless] Update docs
2025-07-14 16:50:07 +02:00
Appu
75582e38e9 [VSCO] Fix JSON returned by VSCO (#7821)
* [VSCO] Fix JSON returned by VSCO

The JSON returned contains multiple `undefined` values that aren't valid
JSON, changing these values to `null` fixes the issue.

* [VSCO] Make JSON fix more targeted

Change only occurrences that are values of a key-value
2025-07-14 15:29:22 +02:00
NecRaul
a7ebb835ea [iwara] Add support (#2652 #5840 #7785)
* [iwara] Add initial support
* [iwara] Add search support
* [iwara] Code cleanup
* [iwara] Small fixes and additions
* [iwara] Add tag support
* [iwara] Add mime-type to metadata
* [iwara] Refactor patterns/matching using urllib
* [iwara] Add unit tests
* [iwara] Update docs
* [iwara] Fix linting on older Python versions
* [iwara] update 'IwaraAPI' interface class
    - define endpoints inside methods
    - implement and use _call() and _pagination()
    - cache auth tokens
* [iwara] split and rename 'profile' extractor
    TODO:
    - update test results
    - simplify code
* [iwara] simplify '_user_params()' usage
* [iwara] update 'video' extractor
    and move user data extraction into 'yield_video'
* [iwara] update 'image' extractor
    and move user info extraction into 'yield_image()'
* [iwara] update 'playlist' extractor
* [iwara] update 'search' extractor
* [iwara] update 'tag' extractor
* [iwara] simplify 'yield_image' usage
    perform API calls to get full 'files' list inside the function
* [iwara] add video "image" test
* [iwara] provide 'date' metadata
* [iwara] simplify 'source()'
    remove urllib.parse usage
* [iwara] small optimizations
    * get("key", {}) -> get("key") or {}
    * split("…", 1) -> partition("…")
    * use f-strings for all patterns
* [iwara] add missing 'keyarg=1' to profile() memcache decorator
* [tests/iwara] update results
* [iwara] extract more 'user' metadata
* [iwara] update default format strings
    include 'date' in filenames to order them chronologically
* [iwara] restructure image/video handling
    - use less generators
    - make processing individual media items non-fatal
* [iwara] fix login and token handling
* [iwara] add 'favorite' extractor
* [iwara] add 'following' and 'followers' extractors

---------

Co-authored-by: Mike Fährmann <mike_faehrmann@web.de>
2025-07-13 21:30:25 +02:00
Mike Fährmann
fc968ebf20 [artstation] support downloading '.mview' files (#7812) 2025-07-12 20:53:16 +02:00
Mike Fährmann
518c771e21 [instagram] provide 'post_url' for stories and highlights (#7810)
fixes regression introduced in 25c63deebf
2025-07-12 17:30:06 +02:00
Mike Fährmann
2d3b6155eb [erome] restructure extractor hierarchy (#7804) 2025-07-11 22:28:26 +02:00
Mike Fährmann
2ccb9acf1a [pinterest] support 'pin.it' board redirects (#7805) 2025-07-11 22:28:26 +02:00
Mike Fährmann
8e40ea2fe2 [pinterest] match board URLs with query strings (#7805) 2025-07-11 22:28:26 +02:00
Mike Fährmann
e68dc960c2 [pixiv] add workaround for 'novel-*' includes (#7746) 2025-07-11 22:27:45 +02:00
Mike Fährmann
267661680c add 'pixiv-novel' compat categories (#7746) 2025-07-11 11:00:19 +02:00
Mike Fährmann
096bc6f784 replace more '.format(…)' calls with f-strings (#7671) 2025-07-10 21:17:17 +02:00
Mike Fährmann
a619638178 [mangadex] allow 'ratings' to be a (comma-separated) string (#7799) 2025-07-10 16:46:18 +02:00
Mike Fährmann
25c63deebf [instagram] provide better 'video_dash_manifest' URLs (#7631) 2025-07-10 15:34:50 +02:00
Mike Fährmann
d8ef1d693f rename 'StopExtraction' to 'AbortExtraction'
for cases where StopExtraction was used to report errors
2025-07-09 21:07:28 +02:00
Mike Fährmann
f17ed0569a [common] raise ChallengeError for CF & DDG challenges (#1945)
this sets bit 0x08 in the exit status bitmask
2025-07-09 18:58:21 +02:00
Mike Fährmann
b77e18317b [skeb] fix "KeyError - 'frame_rate'" (#7798) 2025-07-09 12:40:37 +02:00
Mike Fährmann
cfafbc0675 [twitter] extract 'sensitive_flags' metadata (#2523)
a list of 'sensitive_media_warning' flags per file
and a combination of all file flags per Tweet
2025-07-09 12:39:23 +02:00
Mike Fährmann
321141ba56 merge #7789: [readcomiconline] fix extraction (#7606) 2025-07-07 12:15:16 +02:00
Mike Fährmann
ad84e72fda [civitai] implement retrieving 'post' metadata (#7548) 2025-07-07 11:48:21 +02:00
Mike Fährmann
d25b1f22f3 [civitai] always provide 'file[…]' metadata (#7548)
wrap most top-level values of (single) image results into a 'file' object
2025-07-07 11:45:49 +02:00
Mike Fährmann
8452a018ac [civitai] fix '403 Forbidden' errors for searches
update Authorization header
2025-07-07 10:20:19 +02:00
Mike Fährmann
1aa8fe7c10 [e621] support 'e621.cc/posts' URLs (#6809)
https://github.com/mikf/gallery-dl/issues/6809#issuecomment-3043180010
2025-07-07 08:44:31 +02:00
enduser420
8c1628ea4e [readcomiconline] fix extraction 2025-07-07 02:32:23 +05:30