Commit Graph

708 Commits

Author SHA1 Message Date
Mike Fährmann
9f21c839ad [poringa] improvements and fixes
- add 'num' and 'count' metadata fields
- prevent crash for "private" posts
- prevent crash when there's no 'main-info'
- update tests
2023-12-30 20:37:09 +01:00
Antonio
e348da7a06 [poringa] add support 2023-12-27 00:07:23 -06:00
Mike Fährmann
f9544194c0 [paheal] restore 'extension' metadata (#4976) 2023-12-26 16:09:26 +01:00
Mike Fährmann
77d46e6f0c [lynxchan] update 'bbw-chan' domain (#4970) 2023-12-25 15:29:05 +01:00
Mike Fährmann
108c978073 merge #4919: [postmill] add support (#4917) 2023-12-23 15:23:56 +01:00
Mike Fährmann
2a60645095 [deviantart] set 'is_original' for intermediary URLs to 'false' 2023-12-22 14:49:10 +01:00
Mike Fährmann
01bb75f6cb merge #4945: {shimmie2[ support 'rule34hentai.net' (#861, #4789) 2023-12-22 00:10:26 +01:00
Mike Fährmann
79e4606893 [rule34hentai] cleanup
- fix using 'self._posts_rule34hentai'
- fix 'file_url' for posts
- update docs/supportedsites
- add tests
2023-12-22 00:01:36 +01:00
Mike Fährmann
627ed794a2 [danbooru] provide 'tags' as list (#4942)
keep the old 'tag_string' values around, similar to sankaku

a lot of repeat code ...
would be a lot less bad if "".split(" ") returned an empty list
2023-12-21 14:39:38 +01:00
Mike Fährmann
99aa923322 [inkbunny] improve '/submissionsviewall.php' patterns (#4934)
allow 'mode=…' to be in any position
don't require it to be somewhere in the middle
2023-12-16 19:21:20 +01:00
Mike Fährmann
3f9c113d78 [mastodon] Support non-numeric status IDs (#4936) 2023-12-16 01:52:31 +01:00
Mike Fährmann
2852404e49 [inkbunny] add 'unread' extractor (#4934) 2023-12-15 21:20:12 +01:00
Mike Fährmann
a37b7759bc [myhentaigallery] recognize '/g/' URLs (#4920) 2023-12-12 20:02:28 +01:00
blankie
fbe14a2745 [postmill] add support 2023-12-12 21:36:52 +11:00
Mike Fährmann
bf74eb5c46 merge #4886: [urlgalleries] add 'gallery' extractor (#919, #1184, #2905) 2023-12-08 22:55:58 +01:00
Mike Fährmann
ade93c5397 [urlgalleries] add tests 2023-12-08 22:55:16 +01:00
Mike Fährmann
4eb3590103 [nijie] fix image URLs of multi-image posts (#4876) 2023-12-05 17:48:50 +01:00
Mike Fährmann
c83fbe6c2d merge #4855: [nitter] fix video extraction (#4853) 2023-11-27 18:39:05 +01:00
Mike Fährmann
1137d72d48 [tests] skip test_init for BaseExtractor classes without instances 2023-11-27 18:36:15 +01:00
Mike Fährmann
625e94fa7d update extractor test results
still not everything, but good enough for now
2023-11-27 18:30:53 +01:00
enduser420
1e9bacd169 [nitter] fix video extraction 2023-11-27 21:58:06 +05:30
Mike Fährmann
95c1dfb089 [tests] swap assertEqual argument order
before this, it would show test failures as
+ test value
- extracted value
when it should be the other way round
2023-11-27 01:06:13 +01:00
Mike Fährmann
bdb3ce7217 [foolslide] remove 'powermanga.org' 2023-11-26 23:19:05 +01:00
Mike Fährmann
f9dac43be9 [warosu] fix file URLs 2023-11-24 02:44:55 +01:00
Mike Fährmann
645b4627ef [sankaku] update URL patterns 2023-11-24 02:41:52 +01:00
Mike Fährmann
119755a5a3 [tests] implement skipping/failing tests when pressing ctrl+c 2023-11-24 00:48:37 +01:00
Mike Fährmann
1ae43d8123 merge #4841: [fapello] support '.su' TLD (#4840) 2023-11-22 20:18:32 +01:00
Mike Fährmann
e1404827a6 [pixeldrain] add 'file' and 'album' extractors (#4839) 2023-11-22 19:01:19 +01:00
enduser420
2402162e8a [fapello] support '.su' TLD 2023-11-22 19:35:43 +05:30
Mike Fährmann
725c8dd55a [tmohentai] 'categories' -> 'genres'
quite likely that the site meant 'genres' by "Genders"
2023-11-21 22:11:43 +01:00
Mike Fährmann
ce7c4cb544 merge #4832: [tmohentai] add 'gallery' extractor (#4808) 2023-11-21 20:25:49 +01:00
Mike Fährmann
c4a201ed42 [tmohentai] simplify + tests 2023-11-21 20:24:07 +01:00
Mike Fährmann
e17a48fe56 [blogger] inherit from BaseExtractor
- support www.micmicidol.club (#4759)
2023-11-21 16:52:25 +01:00
Mike Fährmann
0fa85360a0 merge #4812: [erome] add 'count' metadata field 2023-11-20 22:42:02 +01:00
Mike Fährmann
a43cf78bb7 [erome] tests 2023-11-20 22:41:12 +01:00
Mike Fährmann
07cb584231 [behance] add 'modules' option (#4799) 2023-11-17 22:54:38 +01:00
Mike Fährmann
ea78f67860 [downloader:http] skip files not passing filesize-min/-max (#4821)
instead of failing the download
2023-11-17 22:54:20 +01:00
Mike Fährmann
3f591d5a4e [mastodon] update test results 2023-11-11 21:24:07 +01:00
Mike Fährmann
6402f2950f [pp:metadata] ignore non-string tag values (#4764) 2023-11-04 17:33:14 +01:00
Mike Fährmann
007c433677 [patreon] support 'id:<campaign_id>' in place of a user name
https://patreon.com/id:12345
… and remove 'campaign-id' config option
2023-11-04 00:17:41 +01:00
Mike Fährmann
43a3d93467 merge #4755: [twitter] recognize fixupx.com URLs 2023-11-02 15:33:29 +01:00
Mike Fährmann
cdf77e326f [twitter] add test for fixupx.com 2023-11-02 15:32:48 +01:00
Mike Fährmann
fc8f86bf24 [hitomi] recognize 'imageset' gallery URLs (#4756) 2023-11-02 15:29:44 +01:00
Mike Fährmann
72b18d701f represent util.NONE as 'null' in JSON output
was '"None"' before
2023-11-02 15:23:28 +01:00
Mike Fährmann
68e72a836c [exhentai] fix extraction (#4730)
- update to new API response layout
- use proper API server URL
- fix 'filesize' metadata
2023-10-30 13:38:49 +01:00
Mike Fährmann
fd8f58ad76 [behance] unescape embed URLs (#4742) 2023-10-30 13:38:49 +01:00
Mike Fährmann
c9a2be36d4 [sankaku] support '/posts/' tag search URLs (#4740) 2023-10-29 13:48:42 +01:00
Mike Fährmann
218295a4c6 [twitter] fix avatars without 'date' information (#4696) 2023-10-27 17:58:02 +02:00
Mike Fährmann
d0effcae20 [kemonoparty] add 'revision_index' metadata field (#4727) 2023-10-26 22:26:38 +02:00
Mike Fährmann
3bbaa875f1 [kemonoparty] fix parsing of non-standard 'dates' (#4676) 2023-10-26 21:50:18 +02:00