Mike Fährmann
c38856bd3f
[dt] use 'parse_datetime_iso()' for ISO formats
2025-10-19 21:52:05 +02:00
Mike Fährmann
085616e0a8
[dt] replace 'text.parse_datetime()' & 'text.parse_timestamp()'
2025-10-17 17:43:06 +02:00
Mike Fährmann
096bc6f784
replace more '.format(…)' calls with f-strings ( #7671 )
2025-07-10 21:17:17 +02:00
Mike Fährmann
e08ec7e083
update copyright notices
2025-06-13 00:03:41 +02:00
Mike Fährmann
76040f9d68
[issuu] fix extractors ( #7317 )
2025-04-08 17:03:27 +02:00
Mike Fährmann
5807daa19a
[issuu] unescape HTML entities
2025-02-02 18:33:18 +01:00
Mike Fährmann
1b5e0c0e87
[issuu] fix 'user' extractor
2025-01-27 21:56:11 +01:00
Mike Fährmann
9599151118
[issuu] fix extraction
2024-01-20 16:44:48 +01:00
Mike Fährmann
a453335a9f
remove test results in extractor modules
...
and add generic example URLs
2023-09-11 16:30:55 +02:00
ClosedPort22
6dc8be5e48
[issuu] fix extraction
2023-08-13 21:13:50 +08:00
Mike Fährmann
dd884b02ee
replace json.loads with direct calls to JSONDecoder.decode
2023-02-09 15:22:00 +01:00
Mike Fährmann
b0cb4a1b9c
replace 'text.extract()' with 'text.extr()' where possible
2022-11-05 01:14:09 +01:00
Mike Fährmann
0794027100
[issuu] fix extraction ( #2483 )
2022-04-10 14:23:10 +02:00
Mike Fährmann
bd08ee2859
remove most 'yield Message.Version' statements
...
only leave them in oauth.py as noop results
2021-08-16 03:10:48 +02:00
Mike Fährmann
968d3e8465
remove '&' from URL patterns
...
'/?&#' -> '/?#' and '?&#' -> '?#'
According to https://www.ietf.org/rfc/rfc3986.txt , URLs are
"organized hierarchically" by using "the slash ("/"), question
mark ("?"), and number sign ("#") characters to delimit components"
2020-10-22 23:31:25 +02:00
Mike Fährmann
4e361b3008
add tests for specific datetime values
2020-02-23 16:48:30 +01:00
Mike Fährmann
e89413da22
update test results
2020-01-12 21:41:06 +01:00
Mike Fährmann
702f2fbd1f
[issuu] add publication and user extractors ( #413 )
2019-10-31 16:52:57 +01:00