Commit Graph

1291 Commits

Author SHA1 Message Date
Mike Fährmann
4143e8e8b3 [everia] improve/fix image extraction (#7270) 2025-03-30 18:42:05 +02:00
Mike Fährmann
4bcbc2d5b4 [E621:pool] fix AttributeError (#7265)
fixes regression introduced in 24bbcbcfa3
2025-03-30 18:41:58 +02:00
Mike Fährmann
c4d08b24e9 [pinterest] ignore 'story_pin_static_sticker_block' blocks (#7251) 2025-03-28 20:20:29 +01:00
Mike Fährmann
e0211bdaa5 merge #7261: [deviantart] support multiple images for single posts
(#6653)
2025-03-28 17:26:48 +01:00
Mike Fährmann
677c187800 [deviantart] add multi-file test 2025-03-28 17:19:31 +01:00
Mike Fährmann
8bfb5054c1 merge #7191: [tiktok] add retry mechanism to rehydration data extraction 2025-03-28 16:42:36 +01:00
Mike Fährmann
e381c482ec [sexcom] extract 'date_url' metadata (#7239) 2025-03-28 16:30:12 +01:00
Mike Fährmann
d4ce8be1f5 [sexcom] support new-style '/videos' URLs (#7239) 2025-03-27 21:56:06 +01:00
Mike Fährmann
c6bc46f5ba [sexcom] support new-style '/gifs' URLs (#7239) 2025-03-27 20:01:57 +01:00
Mike Fährmann
12327b076e [sexcom] fix 'gif' pin extraction (#7239)
with much less metadata
2025-03-27 19:54:17 +01:00
Mike Fährmann
7ccf64596e [mastodon] support '/statuses' URLs (#7255)
- /statuses/123456789
- /users/USER/statuses/123456789
2025-03-27 18:32:04 +01:00
Mike Fährmann
6894e0bc70 [arcalive] extend 'gifs' option
use fallback URLs by default
2025-03-26 20:59:18 +01:00
Mike Fährmann
24bbcbcfa3 [danbooru] add 'favgroup' extractor 2025-03-26 20:58:49 +01:00
Mike Fährmann
a3a61317d6 [kemonoparty:discord] look up 'channel_name' when missing (#7245) 2025-03-26 11:33:32 +01:00
Mike Fährmann
32a06961ba [mangapark] support v3 URLs (#2072) 2025-03-25 20:01:45 +01:00
Mike Fährmann
ebf05e53fe [mangapark] support mirror domains 2025-03-25 19:37:26 +01:00
Mike Fährmann
03c87b71ca [nozomi] fix extractors (#7242)
update internal domains to 'gold-usergeneratedcontent.net'
2025-03-25 16:54:15 +01:00
Mike Fährmann
0ffef58779 merge #7232: [mangapark] fix extractors (#4999 #5883 #6507 #6908) 2025-03-24 20:36:18 +01:00
vonProteus
58e7808bbb [mangapark] utilizing more graphql and adjust functionality for new site (#4999)
- undo formatting changes
- simplify code
- update and fix tests
2025-03-24 20:34:23 +01:00
Mike Fährmann
e1aabf01e4 merge #7220: [deviantart] add subfolder support (#4988 #7185) 2025-03-24 18:43:11 +01:00
Mike Fährmann
7a6899c647 [imhentai] support 'hentaienvy.com' and 'hentaizap.com' (#7192 #7218)
and move 'hentaifox' support to this module as well
2025-03-24 15:33:19 +01:00
Mike Fährmann
fd8f652490 [hitomi] fix extractors (#7230) 2025-03-23 20:32:27 +01:00
Mike Fährmann
5bceb0eb27 [deviantart] add 'subfolder' tests 2025-03-23 17:59:56 +01:00
Mike Fährmann
b20777e68b [bbc] extract more metadata (#6582)
https://github.com/mikf/gallery-dl/issues/6582#issuecomment-2745905375

- title
- title_image
- description
- synopsis
2025-03-23 15:20:12 +01:00
Mike Fährmann
62f9335b82 [kemonoparty] use '"type": "archive"' for 'archives' files (#7195) 2025-03-23 15:07:15 +01:00
Mike Fährmann
4a74bc6e30 [kemonoparty] extract 'archives' metadata (#7195)
add 'archives' option for additional data
2025-03-22 18:38:21 +01:00
Mike Fährmann
4807bc215c [subscribestar] extract 'title' metadata (#7219) 2025-03-22 09:46:08 +01:00
CasualYT31
d6d2b1fba0 [tiktok] Queue links from user profiles instead of returning a list of links
[tiktok] Move avatar download to the user extractor, which results in more accurate metadata output (it would previously write the metadata of the video which the avatar was scraped from)
[tiktok] Fix tests and remove redundant user profile test
2025-03-19 18:50:06 +00:00
Mike Fährmann
070b75aa82 [patreon] do not match '/messages' URLs as creators (#7187) 2025-03-18 22:07:44 +01:00
Mike Fährmann
a6b8569037 [tests] implement expected failures 2025-03-18 21:48:17 +01:00
Mike Fährmann
567f5d0bc6 [common] add 'subdomains' argument to 'cookies_check()' (#7188) 2025-03-18 14:59:41 +01:00
Mike Fährmann
d746e025a0 [zerochan] parse JSON-LD data (#7178) 2025-03-17 19:59:44 +01:00
Mike Fährmann
6532cf9075 [deviantart] match '/gallery/recommended-for-you' URLs (#7168) 2025-03-17 09:49:11 +01:00
Mike Fährmann
8bdd543935 [deviantart:stash] fix legacy sta.sh links (#7181)
follow redirect instead of rewriting them to deviantart.com/stash/…
2025-03-16 19:38:56 +01:00
Mike Fährmann
bf927cbd4f [config] fix using same key multiple times with 'apply' (#7127) 2025-03-16 19:37:04 +01:00
Mike Fährmann
dbe8820b9e [arcalife] add 'gifs' option (#5657) 2025-03-14 22:34:45 +01:00
Mike Fährmann
5fa5a45f03 [tests] improve error message of multi type/value tests
improvement of 42070240ae
2025-03-14 18:59:57 +01:00
Mike Fährmann
31e57bafab [arcalive] add 'user' extractor (#5657) 2025-03-14 18:58:10 +01:00
hdk5
d900e868e4 [arcalive] add support (#5657 #7100)
* [arca.live] Add extractor skeleton

* [arcalive] update names and formatting

* [arcalive] implement initial file extraction code

* [arcalive] improve '_extract_media()' performance

compile and cache regex on demand

* [arcalive] improve image extraction

- extract 'data-originalurl' URLs if available
- replace URL query strings with 'type=orig'
- ignore emoticons by default

* [arcalive] update defaults

- include 'title' in filenames
- use 0.5-1.5s delay between requests

* [arcalive] use ext from 'data-orig' if available

* [arcalive] update docs/supportedsites

* [arcalive] add tests

* [arcalive] update 'board' extractor pattern

so it doesn't also match 'post' URLs

---------

Co-authored-by: Mike Fährmann <mike_faehrmann@web.de>
2025-03-14 10:52:21 +01:00
Mike Fährmann
22d46f2462 [batoto] add 'domain' option (#7174)
allow legacy domains by default
2025-03-14 10:31:49 +01:00
Mike Fährmann
f395a3ec79 [sankaku] fix potential infinite loop (#7155)
https://github.com/mikf/gallery-dl/issues/7155#issuecomment-2723019761
2025-03-14 08:35:54 +01:00
Mike Fährmann
cd1ddb0a67 [wikimedia] add 'subcategories' option (#2340)
https://github.com/mikf/gallery-dl/pull/2340#issuecomment-2707177295
2025-03-12 22:05:44 +01:00
Mike Fährmann
898a09bf7f [sankaku] fix 'tags' metadata (#7155)
rename 'tag_names' to 'tags'
2025-03-12 17:07:40 +01:00
Mike Fährmann
d40f8a82be [tests] add support for skipping an extractor result test 2025-03-12 16:41:46 +01:00
Mike Fährmann
e1bdcd97e1 [furaffinity] extract 'scraps' metadata (#7015)
boolean value indicating whether a post is part of a user's Scraps
folder or the main gallery
2025-03-12 16:29:16 +01:00
Mike Fährmann
a12ff281e7 merge #7159: [furaffinity] add 'folder' extractor (#1817) 2025-03-12 14:08:32 +01:00
Deer-Spangle
859f1e7d04 [furaffinity] Adding a FuraffinityFolderExtractor, which extracts a single folder
- Ensure FuraffinityGalleryExtractor doesn't detect folder links
- Fix example URL for folder extractor
- Reordering classes a bit
- Another tweak of the regex
- One more go at the regex..
- cleanup
2025-03-12 14:00:50 +01:00
Mike Fährmann
94bbbbb16b [sankaku] fix categorized tags for posts with >100 tags (#7155) 2025-03-11 21:01:46 +01:00
Mike Fährmann
1254c4e3d9 [sankaku] update API URLs (#7154 #7155)
and fix errors due to other changes
2025-03-11 18:45:45 +01:00
Mike Fährmann
518865c7de [civitai] fix/improve query parameter handling (#7138) 2025-03-10 20:18:57 +01:00