Commit Graph

4919 Commits

Author SHA1 Message Date
Mike Fährmann
6e87ad2ccc [fanbox] extract 'archives' metadata (#7454) 2025-05-27 18:04:35 +02:00
Mike Fährmann
47af8459c2 [util] add collections of image/video/archive filename extensions 2025-05-26 23:08:27 +02:00
Mike Fährmann
45cd81639e [fanbox] return 'fileMap' files in order (#2718) 2025-05-26 22:05:51 +02:00
Mike Fährmann
204c94ccb4 [deviantart:tag] fix 'username' (#7587) 2025-05-26 21:34:06 +02:00
Mike Fährmann
5ebea85bd1 [patreon] send 'Referer' when downloading m3u8 videos (#7571) 2025-05-26 17:35:49 +02:00
Mike Fährmann
f1ab8c39fb [kemonoparty] add 'artists' extractor (#7582) 2025-05-26 11:35:55 +02:00
Mike Fährmann
e469dc0da8 [imagevenue] detect 404 image files (#7570) 2025-05-24 22:12:48 +02:00
Mike Fährmann
5e07d30d91 [arcalive] replace 'ac-p.namu' subdomains with 'ac-o.namu' (#7556)
to prevent "403 Forbidden" errors when downloading
2025-05-24 22:12:15 +02:00
Mike Fährmann
1f869c2786 [kemonoparty:discord] update server & channel metadata (#7569)
to match the Discord Extractors' names

- rename
    server       -> server_id
    channel      -> channel_id
    channel_name -> channel
- add
    server
    channel_nsfw
    channel_type
    channel_topic
    parent_id
2025-05-24 18:05:03 +02:00
Mike Fährmann
ed9c960bb9 [kemonoparty] remove '_prepare_ddosguard_cookies()' 2025-05-24 18:05:03 +02:00
Mike Fährmann
e199396872 [common] simplify 'user' extractors by using 'Dispatch' mixin 2025-05-24 18:04:53 +02:00
Mike Fährmann
c3e8af945d [sankaku] fix passing cookies (#7333)
to allow '"tags": "extended"' to work properly
2025-05-23 19:21:56 +02:00
Mike Fährmann
b81fc5c124 replace text.rextract() with rextr() 2025-05-23 18:28:58 +02:00
Mike Fährmann
83054325e0 [arcalive] allow overriding default User-Agent header (#7556) 2025-05-23 08:05:47 +02:00
Mike Fährmann
9c06acb385 [sankaku] compile extended 'tags' pattern only once
per extractor run
2025-05-22 22:30:41 +02:00
Mike Fährmann
b9f077a1c5 [civitai] make metadata extraction non-fatal (#7562) 2025-05-22 19:18:53 +02:00
Mike Fährmann
f5b8c25559 [pinterest] ignore 'story_pin_product_sticker_block' blocks (#7563) 2025-05-22 18:42:39 +02:00
Mike Fährmann
7b5dd61e17 [sankaku] implement support for new 'tags' categories (#7333 #7553) 2025-05-22 12:41:03 +02:00
prowlguru
b8f3ca6c4e [pixiv] Implement sanity handling for ugoira works (#4327 #6297 #7285 #7434)
* [pixiv] Implement sanity handling for ugoira works

* [pixiv] fix PEP formatting for sanity ugoira handling

* [pixiv] update Ugoira AJAX error message

* [pixiv] respect disabled 'ugoira' option

* [tests] add 'limit_sanity_level' ugoira test

---------

Co-authored-by: Mike Fährmann <mike_faehrmann@web.de>
2025-05-21 22:01:09 +02:00
bradenhilton
3ba4404d21 [pixeldrain] add support for filesystem URLs (#7473) 2025-05-21 17:28:09 +02:00
Mike Fährmann
b76e7de1a7 [dl:http] fix setting 'mtime' per file (#7529)
introduce '_http_lastmodified' meta field
2025-05-21 13:50:51 +02:00
Mike Fährmann
b13f464c9a [idolcomplex:pool] add 'Login required' warning 2025-05-21 07:50:22 +02:00
Mike Fährmann
ee3fdb60e9 [idolcomplex] fix 429 error during login by adding a 10s delay 2025-05-21 07:37:11 +02:00
Mike Fährmann
96f7f626d5 [idolcomplex] fix/update pagination logic (#7549) 2025-05-21 07:35:43 +02:00
Mike Fährmann
ac1c61f64a [fanbox] use '"browser": "firefox"' by default (#7490) 2025-05-19 16:47:43 +02:00
Mike Fährmann
0d07dec19c [instagram:stories] fix 'post_date' metadata (#7521) 2025-05-19 16:43:12 +02:00
Mike Fährmann
c28df68f68 [instagram:stories] add 'split' option (#7521) 2025-05-19 16:42:35 +02:00
Mike Fährmann
a6de87d00e [civitai] fix exception for images without modelVersionId (#7432) 2025-05-17 20:17:57 +02:00
Mike Fährmann
b9352bd150 [motherless] fix video gallery downloads (#7530)
Video URLs can no longer be constructed from their thumbnail URLs. They
now need a valid `hash` query parameter or they yield a
'401 Unauthorized' error, thus requiring an extra request per video
to fetch their media page.
2025-05-17 12:12:12 +02:00
Mike Fährmann
311eaf5f11 [subscribestar] fix 'title' extraction for 'trix-attachment' posts (#7526) 2025-05-16 19:09:37 +02:00
Mike Fährmann
0285473b04 [flickr] add 'profile' option 2025-05-13 11:47:34 +02:00
Mike Fährmann
204f1c5f92 [flickr] fix overwriting 'owner'/'user' data when 'info' is enabled 2025-05-13 11:11:03 +02:00
Mike Fährmann
6b84de6cf7 [flickr] add 'info' option (#4720 #6817) 2025-05-12 17:07:36 +02:00
Mike Fährmann
bd7fcdab4c [flickr] provide human-readable 'license_name' metadata 2025-05-12 16:44:22 +02:00
Mike Fährmann
cf6eff7ff7 [flickr] remove constructors 2025-05-12 16:27:14 +02:00
Mike Fährmann
0dce9ad7c6 [mangadex] send Bearer token only when necessary
> Do not send authentication headers unless necessary for your API call
> Authenticated requests cannot be cached, so you're slowing yourself down.

https://api.mangadex.org/docs/02-authentication/
2025-05-12 15:08:13 +02:00
Mike Fährmann
7907d0d3bd [mangadex] add 'following' extractor (#7487)
also fixes the URL pattern for the Updates feed at
https://mangadex.org/titles/feed
2025-05-12 12:58:22 +02:00
Mike Fährmann
f88e42c194 [mangadex] implement login with client credentials
unless the legacy method still works for your account,
login now requires 4 values:

- client-id
- client-secret
- username
- password

where 'client-id' and 'client-secret' are the credentials
of a personal API client.

> Public clients are not yet available.
2025-05-12 11:57:34 +02:00
Mike Fährmann
98fdcd4d72 [subscribestar] fix 'content' extraction (#7486)
and extract 'tags' metadata

Authored by: prowlguru

Co-authored-by: prowlguru <183935626+prowlguru@users.noreply.github.com>
2025-05-10 21:04:27 +02:00
Mike Fährmann
1da44d8fe6 [bluesky] handle exception for posts without 'record' (#7499) 2025-05-10 20:24:25 +02:00
Mike Fährmann
2755425ddd [fanbox] update API headers (#7490) 2025-05-10 20:23:04 +02:00
Mike Fährmann
cd01eacd3d [mastodon] support Akkoma/Pleroma '/objects/:uuid' URLs (#7497) 2025-05-10 18:01:45 +02:00
Mike Fährmann
18eac5ca70 merge #7496: [mastodon] support Akkoma/Pleroma '/notice/:id' URLs 2025-05-10 17:50:34 +02:00
Mike Fährmann
e233360015 [vipergirls] extract more metadata & rename fields (#7479)
- forum_title
- thread_title

- id     -> post_id
- title  -> post_title
- number -> post_num
2025-05-10 17:40:43 +02:00
Mike Fährmann
d872509060 [vipergirls] provide 'num' and 'count' metadata (#7479) 2025-05-10 17:10:32 +02:00
Mike Fährmann
f343c8c0b6 [civitai] fix/improve video downloads (#7502)
- add 'quality-videos' option
- fix downloading static .webp files instead of video
2025-05-10 16:13:48 +02:00
Soblow "Opale" Xaselgio
a94672bede [mastodon] Add support for Akkoma/Pleroma /notice/:status_id urls
Signed-off-by: Soblow "Opale" Xaselgio <113846014+Soblow@users.noreply.github.com>
2025-05-09 12:03:57 +02:00
Mike Fährmann
3799369248 [civitai] provide proper 'extension' for model files (#7432) 2025-05-04 17:34:06 +02:00
Mike Fährmann
b97dc456b0 [twitter] import 'transaction_id' only when needed 2025-05-04 07:42:44 +02:00
Mike Fährmann
6746e38670 [civitai] fix typo 2025-05-03 21:15:26 +02:00