Commit Graph

62 Commits

Author SHA1 Message Date
Mike Fährmann
67c99b1366 [patreon] prevent HttpError for stream.mux.com URLs 2024-01-21 22:50:40 +01:00
Mike Fährmann
2dcfb012ea [patreon] download 'm3u8' manifests with ytdl 2024-01-12 02:33:27 +01:00
Mike Fährmann
1c68b7df01 [patreon] fix KeyError (#5048) 2024-01-11 17:56:47 +01:00
Mike Fährmann
6cd5e6adad [patreon] fix bootstrap data extraction (#4904) 2023-12-15 18:24:52 +01:00
Tobi823
66cbe9da41 - fix style check failure "line to long" 2023-12-10 16:24:00 +01:00
Tobi823
244444b194 - adapt code to current code style 2023-12-10 16:22:32 +01:00
Tobi823
fd06255f93 - reformat and refactor to pass tests 2023-12-10 16:17:34 +01:00
Tobi823
5ff7106d4f - add code for the situation when Patreon is using window.patreon = wrapInProxy({"bootstrap":' to store metadata
- refactor code to make it more readable
- output page content when the HTML structure is unknown (to make debugging easier)
2023-12-10 16:10:46 +01:00
Mike Fährmann
007c433677 [patreon] support 'id:<campaign_id>' in place of a user name
https://patreon.com/id:12345
… and remove 'campaign-id' config option
2023-11-04 00:17:41 +01:00
Mike Fährmann
4730de163f [patreon] refactor _extract_bootstrap() 2023-10-29 15:57:21 +01:00
Tobias Hellmann
28ada11cba Try to parse newer HTTP response from Patreon 2023-10-28 22:27:26 +02:00
Mike Fährmann
12a800ce21 [patreon] improve 'campaign_id' handling (#4699, #4715)
- add ways to directly specify a 'campaign_id'
  - 'campaign-id' config option
  - 'c' or 'campaign_id' URL query parameter
- more descriptive error messages
- show 'campaign_id' value in debug log
2023-10-25 17:18:06 +02:00
Mike Fährmann
63db54b905 [patreon] update 'campaign_id' path (#4639) 2023-10-09 09:41:38 +02:00
Mike Fährmann
7592c5e566 [patreon] fix extraction (#4547) 2023-09-19 13:18:47 +02:00
Mike Fährmann
3ecb512722 send Referer headers by default 2023-09-19 00:02:04 +02:00
Mike Fährmann
a453335a9f remove test results in extractor modules
and add generic example URLs
2023-09-11 16:30:55 +02:00
Mike Fährmann
d97b8c2fba consistent cookie-related names
- rename every cookie variable or method to 'cookies_*'
- simplify '.session.cookies' to just '.cookies'
- more consistent 'login()' structure
2023-07-22 01:20:50 +02:00
Mike Fährmann
dd884b02ee replace json.loads with direct calls to JSONDecoder.decode 2023-02-09 15:22:00 +01:00
Mike Fährmann
88610c3478 [patreon] update API query parameters 2022-12-01 10:02:28 +01:00
Mike Fährmann
c19b1f03b9 [patreon] fix '403 Forbidden' errors
send 'Content-Type' headers for API requests
2022-12-01 09:52:36 +01:00
Mike Fährmann
5b9a22af7f [patreon] improve 'campaign_id' extraction (#3235) 2022-11-30 11:57:52 +01:00
Mike Fährmann
b0cb4a1b9c replace 'text.extract()' with 'text.extr()' where possible 2022-11-05 01:14:09 +01:00
Mike Fährmann
bfa5e61900 [patreon] add explicit 'image_large' file type (#2257)
to allow more control over when and if to download 'large_url' images

4fee3a0e52 forced them to be downloaded
instead of regular images, even though 'large_url' images are most likely
an upscaled version of the original.
2022-03-06 17:07:13 +01:00
Mike Fährmann
4fee3a0e52 [patreon] download 'large_url' images if available (#2257) 2022-02-17 18:23:59 +01:00
Mike Fährmann
7e6981dda6 rename 'disabletls12' to 'tls12'
and let config options override any default settings
2022-02-01 01:37:03 +01:00
Robert Pendell
4c651f6252 [patreon] Disable TLS 1.2 by default (#2249)
Disables TLS 1.2 on Patreon by default.
2022-01-30 23:30:44 +01:00
Mike Fährmann
6f2e0c9c3d fix cookie checks for patreon, fanbox, fantia
The changes in 9a255344 caused a warning about missing cookies to be
displayed even if those cookies were present, because _check_cookies()
did not account for an empty cookiedomain.
2022-01-01 17:55:58 +01:00
Mike Fährmann
9a25534490 use Extractor._check_cookies() for all cookie checks 2021-12-16 02:21:16 +01:00
Mike Fährmann
211de95dd0 update extractor test results 2021-11-01 02:58:53 +01:00
Mike Fährmann
8d676151b7 [patreon] implement 'files' option (#1935) 2021-10-17 04:17:10 +02:00
Mike Fährmann
6695ef2e10 [patreon] better filenames for 'content' images (#1954) 2021-10-16 23:21:14 +02:00
Mike Fährmann
bd08ee2859 remove most 'yield Message.Version' statements
only leave them in oauth.py as noop results
2021-08-16 03:10:48 +02:00
Mike Fährmann
24dd10ac3c [patreon] extract user defined 'tags' (#1539, closes #1540) 2021-05-18 00:35:52 +02:00
Mike Fährmann
fe2ec9cf68 [patreon] reduce redirects when fetching campaign ID 2021-02-27 16:37:41 +01:00
Mike Fährmann
29ea54dc41 [patreon] use '"browser": "firefox"' by default (#1117) 2021-02-27 16:26:42 +01:00
Mike Fährmann
fbfcbcbf57 Merge branch '1.17.0' 2021-02-15 02:32:33 +01:00
Mike Fährmann
ba693d8686 [patreon] skip posts without view permission (#1316) 2021-02-14 16:03:11 +01:00
Mike Fährmann
56a8968435 remove 'Message.Metadata' (#866) 2021-01-31 02:12:37 +01:00
Mike Fährmann
968d3e8465 remove '&' from URL patterns
'/?&#' -> '/?#' and '?&#' -> '?#'

According to https://www.ietf.org/rfc/rfc3986.txt, URLs are
"organized hierarchically" by using "the slash ("/"), question
mark ("?"), and number sign ("#") characters to delimit components"
2020-10-22 23:31:25 +02:00
Mike Fährmann
f1344fe552 [patreon] yield images and attachments before postfiles (#871)
The reported filename of the 'postfile' entry of each post may differ
from the corresponding entry in the list of images or attachments,
and be outright "wrong".
2020-07-09 00:10:26 +02:00
Mike Fährmann
d5bfb0b38c set pseudo extension for Metadata messages (#865)
This prevents pathfmt.filename from potentially being empty.
2020-07-04 22:14:39 +02:00
Mike Fährmann
6294e2c540 add 'text.ensure_http_scheme()' 2020-05-19 22:32:53 +02:00
Mike Fährmann
fc3e54275b [patreon] respect filters and sort order in query params (#711) 2020-04-28 23:58:03 +02:00
Mike Fährmann
46b9a4d8ff [patreon] improve hash extraction (#693, #713)
Instead of accessing a specific part of a download URL, potentially
causing an exception if it doesn't exist, we're now searching through
all parts for a potential MD5 hash without ever raising an exception.
2020-04-28 21:47:18 +02:00
Mike Fährmann
233b6f93a2 [patreon] recognize URLs with creator IDs (#711)
e.g. https://www.patreon.com/user/posts?u=…
2020-04-26 22:19:10 +02:00
Mike Fährmann
08674a91f3 [patreon] fix hash extraction from download URLs (closes #693)
The old method was assuming every URL path ends with '/1'. For URLs
where this is not the case, the segment containing the post ID was
used as file hash.
2020-04-15 23:28:57 +02:00
Mike Fährmann
b9c574bd1d [patreon] log skipped files (#590) 2020-02-11 19:01:07 +01:00
Mike Fährmann
109f6c8685 [patreon] filter duplicate files per post (#590) 2020-02-05 23:38:24 +01:00
Mike Fährmann
8c11e81c9f Merge commit '63e6993716db8d8bedfb7b0d445c7161493046b6' 2019-12-16 17:25:23 +01:00
Mike Fährmann
63e6993716 merge 'bypost' functionality into metadata postprocessor 2019-12-16 17:19:23 +01:00