Mike Fährmann
|
b376fa814e
|
[zerochan] handle "KeyError - 'items'" (#5826)
Zerochan sometimes sends an empty response when there are no more
accessible posts to be had.
|
2024-07-05 21:34:33 +02:00 |
|
Mike Fährmann
|
cc6b9e4c18
|
[zerochan] use API by default (#3669)
add 'pagination' option
|
2024-02-25 00:36:14 +01:00 |
|
Mike Fährmann
|
42335ea880
|
[zerochan] fix skipping every other post
|
2024-02-15 02:51:01 +01:00 |
|
Mike Fährmann
|
adc3aa0b77
|
[zerochan] fix metadata extraction
author, path, tags
|
2023-11-24 21:21:14 +01:00 |
|
Mike Fährmann
|
a453335a9f
|
remove test results in extractor modules
and add generic example URLs
|
2023-09-11 16:30:55 +02:00 |
|
Mike Fährmann
|
d97b8c2fba
|
consistent cookie-related names
- rename every cookie variable or method to 'cookies_*'
- simplify '.session.cookies' to just '.cookies'
- more consistent 'login()' structure
|
2023-07-22 01:20:50 +02:00 |
|
enduser420
|
d52ed2bc5a
|
[zerochan] fix 'tags' extraction
|
2023-07-18 16:38:04 +05:30 |
|
Mike Fährmann
|
ed2d715019
|
fix 'keywords' in extractor tests (#3491)
|
2023-01-03 15:14:23 +01:00 |
|
Mike Fährmann
|
4063563cd7
|
[zerochan] update for layout v3
- remove cookie disabling v3
- fix and improve metadata extraction
|
2022-12-17 12:51:51 +01:00 |
|
Mike Fährmann
|
b0cb4a1b9c
|
replace 'text.extract()' with 'text.extr()' where possible
|
2022-11-05 01:14:09 +01:00 |
|
Mike Fährmann
|
3cb8327c60
|
[zerochan] add 'metadata' option (#2861)
|
2022-09-02 23:25:19 +02:00 |
|
Mike Fährmann
|
21ff77fea0
|
[zerochan] extract more metadata for single posts
Neither HTML pages nor RSS feed entries have *all* metadata.
It might be necessary to do 1-2 extra HTTP requests to grab everything.
|
2022-08-14 17:26:29 +02:00 |
|
Mike Fährmann
|
98af5a0409
|
[zerochan] implement login with username & password (#1434)
|
2022-07-29 12:56:20 +02:00 |
|
Mike Fährmann
|
3a8addfe45
|
[zerochan] add 'tag' and 'image' extractors (#1434)
|
2022-07-27 22:58:23 +02:00 |
|