Commit Graph

18 Commits

Author SHA1 Message Date
Mike Fährmann
a383eca7f6 decouple extractor initialization
Introduce an 'initialize()' function that does the actual init
(session, cookies, config options) and can called separately from
the constructor __init__().

This allows, for example, to adjust config access inside a Job
before most of it already happened when calling 'extractor.find()'.
2023-07-25 22:16:16 +02:00
Mike Fährmann
fceabee433 [philomena] use API interface class
handle 429 errors and retry after 10min (#4288)
2023-07-13 20:46:04 +02:00
Mike Fährmann
a1ffa1ff09 [philomena] fix '--range' (#4288) 2023-07-08 23:17:27 +02:00
Mike Fährmann
09fb212414 [philomena] match URLs with www subdomain 2023-01-24 22:43:24 +01:00
Mike Fährmann
775895f44b [booru] refactor 'tags' and 'notes' extraction
- move HTML request for post pages into its own function
- move gelbooru_v02.py notes extraction to gelbooru.py
  since it only works there
- clean up some code
2022-10-31 12:01:19 +01:00
Mike Fährmann
da11fb32d0 update extractor test results 2022-08-28 00:16:12 +02:00
Mike Fährmann
c6a9bab019 update extractor test results 2022-07-12 15:49:22 +02:00
Mike Fährmann
d26da3b9e5 add pre-generated 'pattern' for supported BaseExtractor sites 2022-05-09 22:20:09 +02:00
Mike Fährmann
7cb29224f0 [philomena] fix search parameter escaping (#2215)
The pluses from search terms in /tags/ URLs need to be
replaced with spaces to get accepted by Philomena.
2022-01-23 01:03:37 +01:00
Mike Fährmann
159631c808 [philomena] use a default 'filter_id' if non is given 2021-12-15 16:20:53 +01:00
Mike Fährmann
cfa4876848 [philomena] support furbooru.org (closes #1995) 2021-11-15 20:57:51 +01:00
Mike Fährmann
211de95dd0 update extractor test results 2021-11-01 02:58:53 +01:00
Mike Fährmann
c3b5c88b04 update extractor test results 2021-07-20 20:21:33 +02:00
Mike Fährmann
e60962f7e5 [philomena] improve tag escapes handling (fixes #1629) 2021-06-16 18:47:08 +02:00
Mike Fährmann
bdfcc9c4b1 update extractor test results 2021-04-18 20:28:15 +02:00
Mike Fährmann
ddd48ceee5 update extractor test results 2021-03-28 23:06:44 +02:00
Mike Fährmann
847e9b0ed7 [philomena] support post URLs without '/images/'
e.g. 'derpibooru.org/1'
2021-03-14 18:26:39 +01:00
Mike Fährmann
c485d0a956 [philomena] add generalized extractors for philomena sites
(closes #1379)
2021-03-14 17:19:57 +01:00