Commit Graph

5005 Commits

Author SHA1 Message Date
Mike Fährmann
9c06acb385 [sankaku] compile extended 'tags' pattern only once
per extractor run
2025-05-22 22:30:41 +02:00
Mike Fährmann
b9f077a1c5 [civitai] make metadata extraction non-fatal (#7562) 2025-05-22 19:18:53 +02:00
Mike Fährmann
f5b8c25559 [pinterest] ignore 'story_pin_product_sticker_block' blocks (#7563) 2025-05-22 18:42:39 +02:00
Mike Fährmann
7b5dd61e17 [sankaku] implement support for new 'tags' categories (#7333 #7553) 2025-05-22 12:41:03 +02:00
prowlguru
b8f3ca6c4e [pixiv] Implement sanity handling for ugoira works (#4327 #6297 #7285 #7434)
* [pixiv] Implement sanity handling for ugoira works

* [pixiv] fix PEP formatting for sanity ugoira handling

* [pixiv] update Ugoira AJAX error message

* [pixiv] respect disabled 'ugoira' option

* [tests] add 'limit_sanity_level' ugoira test

---------

Co-authored-by: Mike Fährmann <mike_faehrmann@web.de>
2025-05-21 22:01:09 +02:00
bradenhilton
3ba4404d21 [pixeldrain] add support for filesystem URLs (#7473) 2025-05-21 17:28:09 +02:00
Mike Fährmann
b76e7de1a7 [dl:http] fix setting 'mtime' per file (#7529)
introduce '_http_lastmodified' meta field
2025-05-21 13:50:51 +02:00
Mike Fährmann
b13f464c9a [idolcomplex:pool] add 'Login required' warning 2025-05-21 07:50:22 +02:00
Mike Fährmann
ee3fdb60e9 [idolcomplex] fix 429 error during login by adding a 10s delay 2025-05-21 07:37:11 +02:00
Mike Fährmann
96f7f626d5 [idolcomplex] fix/update pagination logic (#7549) 2025-05-21 07:35:43 +02:00
Mike Fährmann
ac1c61f64a [fanbox] use '"browser": "firefox"' by default (#7490) 2025-05-19 16:47:43 +02:00
Mike Fährmann
0d07dec19c [instagram:stories] fix 'post_date' metadata (#7521) 2025-05-19 16:43:12 +02:00
Mike Fährmann
c28df68f68 [instagram:stories] add 'split' option (#7521) 2025-05-19 16:42:35 +02:00
Mike Fährmann
a6de87d00e [civitai] fix exception for images without modelVersionId (#7432) 2025-05-17 20:17:57 +02:00
Mike Fährmann
b9352bd150 [motherless] fix video gallery downloads (#7530)
Video URLs can no longer be constructed from their thumbnail URLs. They
now need a valid `hash` query parameter or they yield a
'401 Unauthorized' error, thus requiring an extra request per video
to fetch their media page.
2025-05-17 12:12:12 +02:00
Mike Fährmann
311eaf5f11 [subscribestar] fix 'title' extraction for 'trix-attachment' posts (#7526) 2025-05-16 19:09:37 +02:00
Mike Fährmann
0285473b04 [flickr] add 'profile' option 2025-05-13 11:47:34 +02:00
Mike Fährmann
204f1c5f92 [flickr] fix overwriting 'owner'/'user' data when 'info' is enabled 2025-05-13 11:11:03 +02:00
Mike Fährmann
6b84de6cf7 [flickr] add 'info' option (#4720 #6817) 2025-05-12 17:07:36 +02:00
Mike Fährmann
bd7fcdab4c [flickr] provide human-readable 'license_name' metadata 2025-05-12 16:44:22 +02:00
Mike Fährmann
cf6eff7ff7 [flickr] remove constructors 2025-05-12 16:27:14 +02:00
Mike Fährmann
0dce9ad7c6 [mangadex] send Bearer token only when necessary
> Do not send authentication headers unless necessary for your API call
> Authenticated requests cannot be cached, so you're slowing yourself down.

https://api.mangadex.org/docs/02-authentication/
2025-05-12 15:08:13 +02:00
Mike Fährmann
7907d0d3bd [mangadex] add 'following' extractor (#7487)
also fixes the URL pattern for the Updates feed at
https://mangadex.org/titles/feed
2025-05-12 12:58:22 +02:00
Mike Fährmann
f88e42c194 [mangadex] implement login with client credentials
unless the legacy method still works for your account,
login now requires 4 values:

- client-id
- client-secret
- username
- password

where 'client-id' and 'client-secret' are the credentials
of a personal API client.

> Public clients are not yet available.
2025-05-12 11:57:34 +02:00
Mike Fährmann
98fdcd4d72 [subscribestar] fix 'content' extraction (#7486)
and extract 'tags' metadata

Authored by: prowlguru

Co-authored-by: prowlguru <183935626+prowlguru@users.noreply.github.com>
2025-05-10 21:04:27 +02:00
Mike Fährmann
1da44d8fe6 [bluesky] handle exception for posts without 'record' (#7499) 2025-05-10 20:24:25 +02:00
Mike Fährmann
2755425ddd [fanbox] update API headers (#7490) 2025-05-10 20:23:04 +02:00
Mike Fährmann
cd01eacd3d [mastodon] support Akkoma/Pleroma '/objects/:uuid' URLs (#7497) 2025-05-10 18:01:45 +02:00
Mike Fährmann
18eac5ca70 merge #7496: [mastodon] support Akkoma/Pleroma '/notice/:id' URLs 2025-05-10 17:50:34 +02:00
Mike Fährmann
e233360015 [vipergirls] extract more metadata & rename fields (#7479)
- forum_title
- thread_title

- id     -> post_id
- title  -> post_title
- number -> post_num
2025-05-10 17:40:43 +02:00
Mike Fährmann
d872509060 [vipergirls] provide 'num' and 'count' metadata (#7479) 2025-05-10 17:10:32 +02:00
Mike Fährmann
f343c8c0b6 [civitai] fix/improve video downloads (#7502)
- add 'quality-videos' option
- fix downloading static .webp files instead of video
2025-05-10 16:13:48 +02:00
Soblow "Opale" Xaselgio
a94672bede [mastodon] Add support for Akkoma/Pleroma /notice/:status_id urls
Signed-off-by: Soblow "Opale" Xaselgio <113846014+Soblow@users.noreply.github.com>
2025-05-09 12:03:57 +02:00
Mike Fährmann
3799369248 [civitai] provide proper 'extension' for model files (#7432) 2025-05-04 17:34:06 +02:00
Mike Fährmann
b97dc456b0 [twitter] import 'transaction_id' only when needed 2025-05-04 07:42:44 +02:00
Mike Fährmann
6746e38670 [civitai] fix typo 2025-05-03 21:15:26 +02:00
Mike Fährmann
134c3854ee merge #7469: [weasyl] support '/view/' URLs 2025-05-03 20:45:58 +02:00
Mike Fährmann
7a67348fc2 [civitai] implement retrieving 'version' metadata (#7432) 2025-05-03 20:39:35 +02:00
Soblow "Opale" Xaselgio
07ac157acb Accept Weasyl URLs of format /view/12345
Signed-off-by: Soblow "Opale" Xaselgio <113846014+Soblow@users.noreply.github.com>
2025-05-03 19:35:37 +02:00
Mike Fährmann
edc67983ed [twitter] update 'x-csrf-token' header after ct init (#7467) 2025-05-03 12:55:31 +02:00
Mike Fährmann
771317b36c [twitter:ctid] cache client transaction keys (#7382)
and 'ondemand.s.…a.js' responses
2025-05-03 12:50:00 +02:00
Mike Fährmann
a1fd329783 [tumblr] improve error message for dashboard-only blogs (#7455) 2025-05-03 11:02:38 +02:00
Mike Fährmann
e26ae210bd [kemonoparty] restore using default creator posts endpoint
(#7438 #7450 #7462)

- revert c3fb0f53ee
- add '"endpoint": "legacy+"' option
  to use '/posts-legacy' results and with full metadata
2025-05-02 21:49:28 +02:00
Mike Fährmann
e0913c95b2 [twitter] generate 'x-client-transaction-id' header values (#7382)
TODO: cache ClientTransaction state on disk
2025-05-02 12:10:05 +02:00
Mike Fährmann
bbb1cc719d [chevereto] fix 'date' metadata (#7437) 2025-04-30 11:05:02 +02:00
Mike Fährmann
95ec8c7414 [manganelo:manga] extract more metadata
- date_updated
- status
- tags
2025-04-30 09:01:33 +02:00
Mike Fährmann
7b2bcf68a5 [manganelo] support 'nelomanga.net' and mirror domains (#7423)
- natomanga.com
- nelomanga.net
- manganato.gg
- mangakakalot.gg
2025-04-29 21:12:37 +02:00
prowlguru
5fa1e13866 [pixiv:novel] fix 'embeds' extraction with AJAX request (#7422 #7435)
* [pixiv] fix novel embeds with AJAX request

All novel embeds seem to be broken since
`meta-preload-data` is no longer present in the
html of the php response after a site change.

* [pixiv] remove headers used for old novel embed request

* [pixiv] cleanup novel embed fix

* [pixiv] handle novel embed exceptions

* [pixiv] fix PEP formatting for novel embeds fix

* Fix error message

* [pixiv] include exception info in novel error message

---------

Co-authored-by: prowlguru <prowlguru@github.com>
Co-authored-by: Mike Fährmann <mike_faehrmann@web.de>
2025-04-28 17:14:41 +02:00
Mike Fährmann
16b5d41ce5 [kemonoparty] add 'endpoint' option (#7438) 2025-04-28 16:47:48 +02:00
Mike Fährmann
0725eab34f [deviantart] unescape \' in JSON data (#6653)
https://github.com/mikf/gallery-dl/issues/6653#issuecomment-2833783238
2025-04-28 09:42:16 +02:00