Commit Graph

12 Commits

Author SHA1 Message Date
Mike Fährmann
86a396e086 [bcy] fix JSONDecodeError (#3321) 2022-11-30 14:44:19 +01:00
Mike Fährmann
b0cb4a1b9c replace 'text.extract()' with 'text.extr()' where possible 2022-11-05 01:14:09 +01:00
Mike Fährmann
46b64251eb [bcy] fix extraction (#3103)
- fix regex for non-watermarked images
- fetch data from '/item/detail' pages for all other posts,
  since '/apiv3/user/selfPosts' only has incomplete data
2022-10-26 15:33:07 +02:00
Mike Fährmann
211de95dd0 update extractor test results 2021-11-01 02:58:53 +01:00
Mike Fährmann
21c2da454f update extractor test results 2021-07-04 22:00:32 +02:00
Mike Fährmann
ba8180b5e6 [bcy] don't crash with deleted posts 2021-04-25 18:51:09 +02:00
Mike Fährmann
058cc47e9b [bcy] improve pagination 2021-03-28 23:08:26 +02:00
Mike Fährmann
18213dc5ba release version 1.15.2 2020-10-24 18:57:29 +02:00
Mike Fährmann
3fb41c34c8 [bcy] reduce requests to '/item/detail/<id>' (#613)
The former implementation would try to use the embedded data from
'/item/detail/' pages for every post, even if that wasn't really
necessary.

This commit also fixes some issues with posts only visible to
logged in users.
2020-03-04 01:37:51 +01:00
Mike Fährmann
8fbbaa54ff [bcy] fix partial image URLs (#613)
Images from new posts can have incomplete/partial URLs (1)
without any filename extension when fetching their data from
'/apiv3/user/selfPosts', so now all data gets taken from
'/item/detail/ID' pages.

It is currently unknown how to get the non-watermarked original version
of these images, or if that is possible at all. (2)
Images with a watermark will have their 'filter' metadata field set to
"watermark". For original images this field is an empty string "".

Enabling the 'noop' option will, in addition to the watermarked version,
yield the the '~noop.image' filter version (3),
where 'filter' is set to "noop".

(1) "https://img-bcy-qn.pstatp.com/banciyuan/3ccdff22479c4060aadc86718209b281"
(2) "https://p1-bcy.byteimg.com/img/banciyuan/3ccdff22479c4060aadc86718209b281~tplv-banciyuan-logo-v3:wqnpnLLlhZLlpKfprZTnjotfCuWNiuasoeWFgyAtIEFDR-eIseWlveiAheekvuWMug==.image"
(3) "https://p1-bcy.byteimg.com/img/banciyuan/3ccdff22479c4060aadc86718209b281~noop.image"
2020-02-28 22:57:10 +01:00
Mike Fährmann
4e361b3008 add tests for specific datetime values 2020-02-23 16:48:30 +01:00
Mike Fährmann
719b63d0ca [bcy] add user and post extractors (#592) 2020-02-09 02:37:14 +01:00