27 Commits

Author SHA1 Message Date
Mike Fährmann
968597a302 yield 3-tuples for Message.Directory
adapt tuples to the same length and semantics as other messages
2025-12-05 21:39:52 +01:00
Mike Fährmann
f39c8b125f [4chan] detect files containing only null bytes (#7883) 2025-07-24 09:11:52 +02:00
Mike Fährmann
f2a72d8d1e replace 'request(…).json()' with 'request_json(…)' 2025-06-29 17:50:19 +02:00
Mike Fährmann
9dbe33b6de replace old %-formatted and .format(…) strings with f-strings (#7671)
mostly using flynt
https://github.com/ikamensh/flynt
2025-06-29 17:50:19 +02:00
Mike Fährmann
41191bb60a 'match.group(N)' -> 'match[N]' (#7671)
2.5x faster
2025-06-18 13:05:58 +02:00
Mike Fährmann
a453335a9f remove test results in extractor modules
and add generic example URLs
2023-09-11 16:30:55 +02:00
Mike Fährmann
bd08ee2859 remove most 'yield Message.Version' statements
only leave them in oauth.py as noop results
2021-08-16 03:10:48 +02:00
Mike Fährmann
968d3e8465 remove '&' from URL patterns
'/?&#' -> '/?#' and '?&#' -> '?#'

According to https://www.ietf.org/rfc/rfc3986.txt, URLs are
"organized hierarchically" by using "the slash ("/"), question
mark ("?"), and number sign ("#") characters to delimit components"
2020-10-22 23:31:25 +02:00
Mike Fährmann
0ab9bb1721 [4chan] add extractor for entire boards (closes #510) 2019-12-09 20:36:05 +01:00
Mike Fährmann
64786363be [4chan] simplify
- remove 'chan.py'
- slight adjustments to directory and filenames
2019-11-02 20:11:21 +01:00
Mike Fährmann
6284731107 simplify extractor constants
- single strings for URL patterns
- tuples instead of lists for 'directory_fmt' and 'test'
- single-tuple tests where applicable
2019-02-08 13:45:40 +01:00
Mike Fährmann
9e188f6a21 [4chan] support 4channel.org domain 2018-11-21 17:40:38 +01:00
Mike Fährmann
f10bd5cdbe [4chan] unescape filenames 2018-06-12 23:19:38 +02:00
Mike Fährmann
2ef3c35c98 smaller textual changes
- swapped doc for deviantart.mature and .original
- updated gallery-dl.conf
- "transferred" -> "delegated"
2017-10-09 23:23:19 +02:00
Mike Fährmann
872b8aed97 [4chan] update test 2017-03-03 15:50:10 +01:00
Mike Fährmann
94e10f249a code adjustments according to pep8 nr2 2017-02-01 00:53:19 +01:00
Mike Fährmann
f7c993d952 set 'extension' keyword whenever possible 2016-10-07 15:11:58 +02:00
Mike Fährmann
c20a3b3a01 [*chan] code cleanup 2016-10-03 08:23:40 +02:00
Mike Fährmann
56d810c896 update keyword hashes for tests 2016-09-25 17:28:46 +02:00
Mike Fährmann
49a05c32ed add missing tests 2016-09-19 16:15:27 +02:00
Mike Fährmann
d7e168799d consistent extractor naming scheme + docstrings 2016-09-12 10:34:31 +02:00
Mike Fährmann
2dfe97dd00 [chan] update to new format 2015-11-21 03:13:06 +01:00
Mike Fährmann
3c13548f29 rewrite extractors to use config-module 2015-10-05 15:51:08 +02:00
Mike Fährmann
d8ef128e74 [4chan] update default filename and directory 2015-09-07 16:24:16 +02:00
Mike Fährmann
c9ef181b3c [4chan] use api 2015-06-29 23:14:35 +02:00
Mike Fährmann
5993177ae3 [4chan] more metadata entries (size, width, height) 2015-05-04 14:57:47 +02:00
Mike Fährmann
07f23dc2fa add extractor '4chan' 2015-04-14 15:04:07 +02:00