Commit Graph

3141 Commits

Author SHA1 Message Date
Mike Fährmann
76b01b64cf [kemonoparty] remove MD5 hash extraction (#3531)
This partially reverts commit 20d6194ffa.
2023-01-25 11:10:09 +01:00
Mike Fährmann
09fb212414 [philomena] match URLs with www subdomain 2023-01-24 22:43:24 +01:00
Mike Fährmann
7e2fd2e573 merge #3560: [deviantart] add support for /deviation/ and fav.me URLs 2023-01-23 14:16:18 +01:00
Mike Fährmann
caae8fefe1 merge #3541: [deviantart] add extractor for status updates 2023-01-23 13:54:51 +01:00
ClosedPort22
c90b4ea8d9 [deviantart] add support for fav.me URLs 2023-01-23 20:35:03 +08:00
Mike Fährmann
d63af4f3d3 merge #3555: [generic] fix regex for non-src image URLs 2023-01-23 13:18:23 +01:00
Mike Fährmann
8993b10751 [mastodon] add 'num' and 'count' metadata fields (#3517) 2023-01-23 13:10:11 +01:00
Mike Fährmann
d817d23ccb [instagram] update csrf token handling
- update internal value according to cookie
- do not send a second 'csrftoken' cookie
2023-01-23 13:01:35 +01:00
Mike Fährmann
00b94946b3 [instagram] show -o cursor=… after every error (#3440) 2023-01-23 13:00:44 +01:00
ClosedPort22
674c719646 [deviantart] refactor base36 conversion 2023-01-23 17:59:55 +08:00
ClosedPort22
293abb8921 [deviantart] add support for /deviation/ URLs 2023-01-23 17:37:57 +08:00
thatfuckingbird
8cfeed78b1 [generic] fix regex for non-src image URLs 2023-01-21 22:32:42 +01:00
Mike Fährmann
fc6ea8ee5c [instagram] update API domain and headers 2023-01-21 17:30:39 +01:00
ClosedPort22
597b89245e [deviantart] misc improvements to status extractor
- relax regex pattern
- handle invalid 'items' field
- add a test for shared sta.sh item

Co-authored-by: Mike Fährmann <mike_faehrmann@web.de>
2023-01-21 13:04:27 +08:00
Mike Fährmann
137de090dd merge #3549: [twitter] fix search (#3536) 2023-01-20 14:46:30 +01:00
Mike Fährmann
02e314c1b6 merge #3537: [wikifeet/wikifeetx] add 'gallery' extractor 2023-01-20 14:44:25 +01:00
Mike Fährmann
568112dfbb [oauth] improve output
- show which api key / client id gets used (#3518)
- show in which browser authorization URLs gets opened in
2023-01-20 12:33:26 +01:00
ClosedPort22
ab58c375b4 [twitter] fix search (#3536)
- partially revert 18fe4b334d
- properly search for cursor when processing 'replaceEntry'
2023-01-20 14:12:25 +08:00
Mike Fährmann
df91ebb945 [oauth] simplify OAuth 1.0a init 2023-01-19 18:34:54 +01:00
ClosedPort22
013733c9e9 [deviantart] fix index fields for embedded/shared images 2023-01-19 12:17:27 +08:00
ClosedPort22
c4aeca7a5a [deviantart] improve handling of statuses
- recursively yield statuses
- ignore items with missing or unexpected field(s)
2023-01-19 11:48:21 +08:00
ClosedPort22
3b32671fbd [deviantart] add extractor for status updates
extract user status updates using the '/user/statuses/' endpoint
2023-01-19 11:30:48 +08:00
Mike Fährmann
107c60c973 [sankaku] update URL pattern (#3523)
match tag searches with language codes without a trailing slash
2023-01-18 21:38:01 +01:00
enduser420
5cb263fdd2 [wikifeet/wikifeetx] add 'gallery' extractor 2023-01-16 21:08:45 +05:30
Mike Fährmann
35a30498bc merge #3531: [kemonoparty] improve hash extraction
- extract md5 hashes if available
- extract discord file hashes
2023-01-16 15:15:32 +01:00
Mike Fährmann
9683d79bb7 [twitter] "fix" search pagination (#3536, #3534)
- properly process instructions
- do not expect a predetermined instruction order
2023-01-16 14:58:30 +01:00
Mike Fährmann
4fec848858 [twitter] use "browser": "firefox" by default (#3522)
and reenable TLS 1.2 ciphers
2023-01-15 22:11:04 +01:00
Mike Fährmann
78937564fd [twitter] fix login after 32b03433 2023-01-15 22:10:21 +01:00
ClosedPort22
20d6194ffa [kemonoparty] improve hash extraction
- extract MD5 hash from URLs
- extract MD5 and SHA256 hash from Discord URLs (kemono.party only)
- minor optimization (do not call 'hashes.add' when 'duplicates' is
  true)
- update tests accordingly

Co-authored-by: Mike Fährmann <mike_faehrmann@web.de>
2023-01-15 12:01:13 +08:00
Mike Fährmann
80a2ff2d38 support setting 'write-pages' to "ALL"
to show authentication header, cookies, etc
2023-01-14 22:34:46 +01:00
Mike Fährmann
c881548a27 add 'extractor.retry-codes' option (#3313)
do not retry 429 and 430 by default
2023-01-14 17:25:30 +01:00
Mike Fährmann
e30e8aeef7 [mastodon] rename '_check_move' -> '_check_moved' 2023-01-14 14:46:24 +01:00
Mike Fährmann
32b0343334 [twitter] refresh guest tokens (#3445, #3458) 2023-01-13 22:19:25 +01:00
Mike Fährmann
512abeb4ae [booru] add 'url' option 2023-01-13 16:32:32 +01:00
Mike Fährmann
c87bd1a752 [danbooru] extend 'metadata' option
make it possible to specify a custom list of metadata includes
2023-01-13 16:20:01 +01:00
Mike Fährmann
26c3292538 [twitter] disable TLS 1.2 ciphers by default (#3522) 2023-01-13 16:05:43 +01:00
Mike Fährmann
18fe4b334d [twitter] remove 'tweet_search_mode' from search parameters (#3522)
and update API root and general query parameters
2023-01-13 15:50:46 +01:00
Mike Fährmann
85bd1cbc89 [kemonoparty] fix regression from 473bd380 (#3519)
- do not access 'response.content' unless necessary
- only validate responses if filename extensions differ
2023-01-11 15:25:01 +01:00
Mike Fährmann
473bd380c8 [kemonoparty] reject invalid/empty files (#3510) 2023-01-10 19:04:47 +01:00
Mike Fährmann
4833ec323e [imagefap] add 'folder' extractor (#3504) 2023-01-08 16:57:31 +01:00
Mike Fährmann
362cd6991b [pixiv] implement 'metadata-bookmark' option (#3417) 2023-01-07 23:19:43 +01:00
Mike Fährmann
2142b9c7ae merge #3503: [myhentaigallery] handle whitespace before title tag 2023-01-07 15:15:11 +01:00
Mike Fährmann
3a0450adbf [behance] use default delay between requests (#2507) 2023-01-07 14:49:26 +01:00
Mike Fährmann
2cae4567ba [telegraph] fix file URLs (#3506) 2023-01-07 14:35:06 +01:00
Mike Fährmann
cbaeee9533 [imagefap] warn about redirects to '/human-verification' (#1140) 2023-01-07 13:04:42 +01:00
Mike Fährmann
435de1329a [imagefap] use default delay between requests (#1140) 2023-01-07 12:59:09 +01:00
Erik Rimskog
a8a982359e [myhentaigallery] handle whitespace before the title tag 2023-01-06 16:33:04 +01:00
Mike Fährmann
d1dd52349a merge #3189: [tcbscans] add 'chapter' and 'manga' extractors 2023-01-06 13:42:27 +01:00
Mike Fährmann
2f31d21509 merge #3455: [twitter] apply tweet type checks before uniqueness check 2023-01-06 13:32:50 +01:00
enduser420
e8541a131d [tcbscans] add 'chapter' and 'manga' extractors 2023-01-06 16:16:31 +05:30