Commit Graph

1275 Commits

Author SHA1 Message Date
Mike Fährmann
ebf05e53fe [mangapark] support mirror domains 2025-03-25 19:37:26 +01:00
Mike Fährmann
03c87b71ca [nozomi] fix extractors (#7242)
update internal domains to 'gold-usergeneratedcontent.net'
2025-03-25 16:54:15 +01:00
Mike Fährmann
0ffef58779 merge #7232: [mangapark] fix extractors (#4999 #5883 #6507 #6908) 2025-03-24 20:36:18 +01:00
vonProteus
58e7808bbb [mangapark] utilizing more graphql and adjust functionality for new site (#4999)
- undo formatting changes
- simplify code
- update and fix tests
2025-03-24 20:34:23 +01:00
Mike Fährmann
e1aabf01e4 merge #7220: [deviantart] add subfolder support (#4988 #7185) 2025-03-24 18:43:11 +01:00
Mike Fährmann
7a6899c647 [imhentai] support 'hentaienvy.com' and 'hentaizap.com' (#7192 #7218)
and move 'hentaifox' support to this module as well
2025-03-24 15:33:19 +01:00
Mike Fährmann
fd8f652490 [hitomi] fix extractors (#7230) 2025-03-23 20:32:27 +01:00
Mike Fährmann
5bceb0eb27 [deviantart] add 'subfolder' tests 2025-03-23 17:59:56 +01:00
Mike Fährmann
b20777e68b [bbc] extract more metadata (#6582)
https://github.com/mikf/gallery-dl/issues/6582#issuecomment-2745905375

- title
- title_image
- description
- synopsis
2025-03-23 15:20:12 +01:00
Mike Fährmann
62f9335b82 [kemonoparty] use '"type": "archive"' for 'archives' files (#7195) 2025-03-23 15:07:15 +01:00
Mike Fährmann
4a74bc6e30 [kemonoparty] extract 'archives' metadata (#7195)
add 'archives' option for additional data
2025-03-22 18:38:21 +01:00
Mike Fährmann
4807bc215c [subscribestar] extract 'title' metadata (#7219) 2025-03-22 09:46:08 +01:00
Mike Fährmann
070b75aa82 [patreon] do not match '/messages' URLs as creators (#7187) 2025-03-18 22:07:44 +01:00
Mike Fährmann
a6b8569037 [tests] implement expected failures 2025-03-18 21:48:17 +01:00
Mike Fährmann
567f5d0bc6 [common] add 'subdomains' argument to 'cookies_check()' (#7188) 2025-03-18 14:59:41 +01:00
Mike Fährmann
d746e025a0 [zerochan] parse JSON-LD data (#7178) 2025-03-17 19:59:44 +01:00
Mike Fährmann
6532cf9075 [deviantart] match '/gallery/recommended-for-you' URLs (#7168) 2025-03-17 09:49:11 +01:00
Mike Fährmann
8bdd543935 [deviantart:stash] fix legacy sta.sh links (#7181)
follow redirect instead of rewriting them to deviantart.com/stash/…
2025-03-16 19:38:56 +01:00
Mike Fährmann
bf927cbd4f [config] fix using same key multiple times with 'apply' (#7127) 2025-03-16 19:37:04 +01:00
Mike Fährmann
dbe8820b9e [arcalife] add 'gifs' option (#5657) 2025-03-14 22:34:45 +01:00
Mike Fährmann
5fa5a45f03 [tests] improve error message of multi type/value tests
improvement of 42070240ae
2025-03-14 18:59:57 +01:00
Mike Fährmann
31e57bafab [arcalive] add 'user' extractor (#5657) 2025-03-14 18:58:10 +01:00
hdk5
d900e868e4 [arcalive] add support (#5657 #7100)
* [arca.live] Add extractor skeleton

* [arcalive] update names and formatting

* [arcalive] implement initial file extraction code

* [arcalive] improve '_extract_media()' performance

compile and cache regex on demand

* [arcalive] improve image extraction

- extract 'data-originalurl' URLs if available
- replace URL query strings with 'type=orig'
- ignore emoticons by default

* [arcalive] update defaults

- include 'title' in filenames
- use 0.5-1.5s delay between requests

* [arcalive] use ext from 'data-orig' if available

* [arcalive] update docs/supportedsites

* [arcalive] add tests

* [arcalive] update 'board' extractor pattern

so it doesn't also match 'post' URLs

---------

Co-authored-by: Mike Fährmann <mike_faehrmann@web.de>
2025-03-14 10:52:21 +01:00
Mike Fährmann
22d46f2462 [batoto] add 'domain' option (#7174)
allow legacy domains by default
2025-03-14 10:31:49 +01:00
Mike Fährmann
f395a3ec79 [sankaku] fix potential infinite loop (#7155)
https://github.com/mikf/gallery-dl/issues/7155#issuecomment-2723019761
2025-03-14 08:35:54 +01:00
Mike Fährmann
cd1ddb0a67 [wikimedia] add 'subcategories' option (#2340)
https://github.com/mikf/gallery-dl/pull/2340#issuecomment-2707177295
2025-03-12 22:05:44 +01:00
Mike Fährmann
898a09bf7f [sankaku] fix 'tags' metadata (#7155)
rename 'tag_names' to 'tags'
2025-03-12 17:07:40 +01:00
Mike Fährmann
d40f8a82be [tests] add support for skipping an extractor result test 2025-03-12 16:41:46 +01:00
Mike Fährmann
e1bdcd97e1 [furaffinity] extract 'scraps' metadata (#7015)
boolean value indicating whether a post is part of a user's Scraps
folder or the main gallery
2025-03-12 16:29:16 +01:00
Mike Fährmann
a12ff281e7 merge #7159: [furaffinity] add 'folder' extractor (#1817) 2025-03-12 14:08:32 +01:00
Deer-Spangle
859f1e7d04 [furaffinity] Adding a FuraffinityFolderExtractor, which extracts a single folder
- Ensure FuraffinityGalleryExtractor doesn't detect folder links
- Fix example URL for folder extractor
- Reordering classes a bit
- Another tweak of the regex
- One more go at the regex..
- cleanup
2025-03-12 14:00:50 +01:00
Mike Fährmann
94bbbbb16b [sankaku] fix categorized tags for posts with >100 tags (#7155) 2025-03-11 21:01:46 +01:00
Mike Fährmann
1254c4e3d9 [sankaku] update API URLs (#7154 #7155)
and fix errors due to other changes
2025-03-11 18:45:45 +01:00
Mike Fährmann
518865c7de [civitai] fix/improve query parameter handling (#7138) 2025-03-10 20:18:57 +01:00
Mike Fährmann
ce01835995 [facebook] improve 'date' extraction (#7151)
use 'created_time' as alternative when 'publish_time' isn't available
2025-03-10 17:35:32 +01:00
Mike Fährmann
04464b6cf0 [text] add second argument to 'parse_query_list()' (#7138)
return only values whose name is in 'as_list' as a list
2025-03-10 09:36:50 +01:00
Mike Fährmann
d6281b5685 [tenor] relax '/view/' URL pattern (#6075) 2025-03-08 15:54:46 +01:00
Mike Fährmann
52aa5bad4f [tenor] rename 'content_description' to just 'description' 2025-03-08 10:05:49 +01:00
Mike Fährmann
486e307ecd [reddit] add 'selftext' option (#7111) 2025-03-08 09:01:50 +01:00
Mike Fährmann
18b9ffe8c3 [redgifs:search] support '/search?query=...' URLs (#7118) 2025-03-07 12:35:35 +01:00
Mike Fährmann
8582af3483 [tenor] support '/official/' user URLs (#6075) 2025-03-07 11:34:33 +01:00
Mike Fährmann
639ddc95e7 [tenor] support URLs with language codes (#6075) 2025-03-07 11:19:18 +01:00
Mike Fährmann
984116ada7 [furaffinity] improve 'artist_url' extraction (#7115 #7123) 2025-03-06 14:15:11 +01:00
Mike Fährmann
f5073605f6 [tenor] add 'user' extractor (#6075) 2025-03-04 21:47:16 +01:00
Mike Fährmann
198593bf46 [vsco] fix extracting videos from '/gallery' results (#7113) 2025-03-04 16:10:48 +01:00
Mike Fährmann
3a5adbf644 [vsco] fix 'video' extractor (#7113)
fixes regression introduced in 6420210b0f
2025-03-04 09:43:05 +01:00
Mike Fährmann
2f3265a8ae [tenor] add initial support (#6075) 2025-03-03 19:04:50 +01:00
Mike Fährmann
f232a07faf [danbooru:pool] download posts in pool order (#7091)
- add 'order-posts' option
- add 'num' metadata field for pool position
- update default filenames to order by pool position
2025-03-03 16:46:43 +01:00
Mike Fährmann
db19990a82 [text] allow calling 'extract_iter' with invalid arguments 2025-03-02 10:44:06 +01:00
Luca Russo
95c1feab1c [discord] add single message support 2025-02-26 22:16:53 +01:00