Commit Graph

1391 Commits

Author SHA1 Message Date
Mike Fährmann
827eeca0bc [paheal] fix '404 Not Found' for tags with URL encoded characters (#7642) 2025-06-08 16:23:11 +02:00
Mike Fährmann
b95b71f63f [tests:results] implement general tests for list items 2025-06-07 10:36:43 +02:00
Mike Fährmann
fc01d85823 [tests:results] split '_test_kwdict()' 2025-06-07 10:16:47 +02:00
Mike Fährmann
6e120f2551 [danbooru] fix Ugoira for instances without 'Ugoira:FrameMimeType'
(#7630)

fixes regression introduced in 1866f8b97b
2025-06-07 07:47:03 +02:00
Mike Fährmann
3e423937d2 [misskey] implement 'include' option (#5347) 2025-06-06 20:52:03 +02:00
Mike Fährmann
ac09cac978 [misskey] add 'avatar' and 'background' extractors (#5347) 2025-06-06 20:14:05 +02:00
Mike Fährmann
9c4cef822e [komikcast] update domain to 'komikcast02.com' 2025-06-06 20:14:02 +02:00
Mike Fährmann
b5c88b3d3e replace standard library 're' uses with 'util.re()' 2025-06-06 13:24:52 +02:00
Mike Fährmann
e1f03a5a93 [util] allow 'noop()' to accept one optional argument
fixes 'NullResponse.close()'
2025-06-05 20:08:24 +02:00
Mike Fährmann
685836f6fd [dynastyscans] add 'anthology' extractor (#7627) 2025-06-04 21:23:49 +02:00
Mike Fährmann
b5334f5837 [everia] prevent redirect when fetching a post page 2025-06-04 11:09:40 +02:00
missionfloyd
72e1a4a0cb [everia] unquote URLs (#7620)
* [everia.club] unescape URLs

* add test
2025-06-04 09:38:06 +02:00
Mike Fährmann
3c6c40d4ed [nijie] fix file extraction (#7624)
ignore empty URLs / URLs with no 'src="'
2025-06-04 07:57:27 +02:00
Mike Fährmann
4fc719bb10 [util] add 're' helper 2025-06-04 07:56:43 +02:00
Mike Fährmann
df4845bf60 [sexcom] update
- fix 'title' and 'type' of pictures
- remove '#' from the beginning of each tag
- add 'gifs' option
2025-06-04 07:44:43 +02:00
thatDudo
0b0152b347 [rawkuma] add support (#4571)
* Add rawkuma extractor

* Fix flake8 warnings

* Remove fstring

* Fix regex call

* update domain to rawkuma.net

* fix 'manga' extractor

* fix 'chapter' extractor

* add tests

* update docs/supportedsites

---------

Co-authored-by: Mike Fährmann <mike_faehrmann@web.de>
2025-06-02 19:15:53 +02:00
wankio
47c7c85f46 [sexcom] support '/pics/' URLs (#7611)
* Update sexcom.py

For Pics url
https://www.sex.com/en/pics/1459016

* reorder 'pattern'

* strip '?width=…' query parameter

* add test

---------

Co-authored-by: Mike Fährmann <mike_faehrmann@web.de>
2025-06-01 11:13:17 +02:00
Mike Fährmann
9ed31cf992 [civitai] fix & update 'search' extractor (#7609)
- rename to 'search-models'
- use SearchAPI
2025-06-01 10:30:09 +02:00
Mike Fährmann
dc467c7790 [civitai] add 'search-images' extractor (#7609) 2025-06-01 10:06:18 +02:00
Mike Fährmann
02b34e6680 [civitai] support "My Reactions" results for videos (#7608) 2025-05-31 21:30:39 +02:00
Mike Fährmann
9c3e458ced [tests:extractor] check initial '_cfgpath' 2025-05-31 11:00:11 +02:00
Mike Fährmann
8adf48316f [ytdl] detect yt-dlp independent of module name (#7599) 2025-05-30 23:52:32 +02:00
Mike Fährmann
ec523c2c2c [mangasee] remove module 2025-05-30 18:04:55 +02:00
Mike Fährmann
922c296482 [kemono][coomer][schalenetwork] rename modules & extractors
category changes:

- kemonoparty -> kemono
- coomerparty -> coomer
- koharu      -> schalenetwork

also wanted to rename '2chan' -> 'sturdychan',
but the site's main page is still titled '2chen'
2025-05-30 17:51:49 +02:00
Mike Fährmann
002c25d417 [util] add 'NullResponse' class 2025-05-29 20:27:40 +02:00
Mike Fährmann
539ed8fef4 [formatter] add 'D' conversion 2025-05-29 20:09:26 +02:00
Mike Fährmann
17b2910938 [util] implement 'to_datetime()' 2025-05-28 20:10:18 +02:00
Mike Fährmann
6e87ad2ccc [fanbox] extract 'archives' metadata (#7454) 2025-05-27 18:04:35 +02:00
Mike Fährmann
456e93d489 [tests:results] enable 'extension' and ':?' in format tests 2025-05-26 13:50:42 +02:00
Mike Fährmann
e4d139c3ce [tests:text] add cases for query params with empty values 2025-05-26 13:47:46 +02:00
Mike Fährmann
f1ab8c39fb [kemonoparty] add 'artists' extractor (#7582) 2025-05-26 11:35:55 +02:00
Mike Fährmann
e469dc0da8 [imagevenue] detect 404 image files (#7570) 2025-05-24 22:12:48 +02:00
Mike Fährmann
1f869c2786 [kemonoparty:discord] update server & channel metadata (#7569)
to match the Discord Extractors' names

- rename
    server       -> server_id
    channel      -> channel_id
    channel_name -> channel
- add
    server
    channel_nsfw
    channel_type
    channel_topic
    parent_id
2025-05-24 18:05:03 +02:00
Mike Fährmann
f3ed15573a [text] add 'rextr()' 2025-05-23 17:28:58 +02:00
Mike Fährmann
f5b8c25559 [pinterest] ignore 'story_pin_product_sticker_block' blocks (#7563) 2025-05-22 18:42:39 +02:00
Mike Fährmann
7b5dd61e17 [sankaku] implement support for new 'tags' categories (#7333 #7553) 2025-05-22 12:41:03 +02:00
prowlguru
b8f3ca6c4e [pixiv] Implement sanity handling for ugoira works (#4327 #6297 #7285 #7434)
* [pixiv] Implement sanity handling for ugoira works

* [pixiv] fix PEP formatting for sanity ugoira handling

* [pixiv] update Ugoira AJAX error message

* [pixiv] respect disabled 'ugoira' option

* [tests] add 'limit_sanity_level' ugoira test

---------

Co-authored-by: Mike Fährmann <mike_faehrmann@web.de>
2025-05-21 22:01:09 +02:00
bradenhilton
3ba4404d21 [pixeldrain] add support for filesystem URLs (#7473) 2025-05-21 17:28:09 +02:00
Mike Fährmann
a6de87d00e [civitai] fix exception for images without modelVersionId (#7432) 2025-05-17 20:17:57 +02:00
Mike Fährmann
0285473b04 [flickr] add 'profile' option 2025-05-13 11:47:34 +02:00
Mike Fährmann
6b84de6cf7 [flickr] add 'info' option (#4720 #6817) 2025-05-12 17:07:36 +02:00
Mike Fährmann
bd7fcdab4c [flickr] provide human-readable 'license_name' metadata 2025-05-12 16:44:22 +02:00
Mike Fährmann
7907d0d3bd [mangadex] add 'following' extractor (#7487)
also fixes the URL pattern for the Updates feed at
https://mangadex.org/titles/feed
2025-05-12 12:58:22 +02:00
Mike Fährmann
98fdcd4d72 [subscribestar] fix 'content' extraction (#7486)
and extract 'tags' metadata

Authored by: prowlguru

Co-authored-by: prowlguru <183935626+prowlguru@users.noreply.github.com>
2025-05-10 21:04:27 +02:00
Mike Fährmann
cd01eacd3d [mastodon] support Akkoma/Pleroma '/objects/:uuid' URLs (#7497) 2025-05-10 18:01:45 +02:00
Mike Fährmann
18eac5ca70 merge #7496: [mastodon] support Akkoma/Pleroma '/notice/:id' URLs 2025-05-10 17:50:34 +02:00
Mike Fährmann
1f7c29aee9 [tests:mastodon] move comments into '#comments' fields 2025-05-10 17:49:52 +02:00
Mike Fährmann
e233360015 [vipergirls] extract more metadata & rename fields (#7479)
- forum_title
- thread_title

- id     -> post_id
- title  -> post_title
- number -> post_num
2025-05-10 17:40:43 +02:00
Mike Fährmann
d872509060 [vipergirls] provide 'num' and 'count' metadata (#7479) 2025-05-10 17:10:32 +02:00
Mike Fährmann
f343c8c0b6 [civitai] fix/improve video downloads (#7502)
- add 'quality-videos' option
- fix downloading static .webp files instead of video
2025-05-10 16:13:48 +02:00