Commit Graph

10 Commits

Author SHA1 Message Date
Mike Fährmann
a383eca7f6 decouple extractor initialization
Introduce an 'initialize()' function that does the actual init
(session, cookies, config options) and can called separately from
the constructor __init__().

This allows, for example, to adjust config access inside a Job
before most of it already happened when calling 'extractor.find()'.
2023-07-25 22:16:16 +02:00
Mike Fährmann
c6a9bab019 update extractor test results 2022-07-12 15:49:22 +02:00
Mike Fährmann
e1f501ed14 [mangakakalot] update domain 2022-07-11 00:29:25 +02:00
Mike Fährmann
4c49174579 [mangakakalot] update domain and fix extraction 2021-10-31 02:16:13 +01:00
Mike Fährmann
0265fbda61 [mangakakalot] fix extraction 2021-01-22 17:24:18 +01:00
Mike Fährmann
8e3a324c91 [mangakakalot] ignore "Go Home" buttons in chapter pages 2020-10-23 02:33:35 +02:00
Mike Fährmann
968d3e8465 remove '&' from URL patterns
'/?&#' -> '/?#' and '?&#' -> '?#'

According to https://www.ietf.org/rfc/rfc3986.txt, URLs are
"organized hierarchically" by using "the slash ("/"), question
mark ("?"), and number sign ("#") characters to delimit components"
2020-10-22 23:31:25 +02:00
Mike Fährmann
11b744d971 [mangakakalot] improve/fix chapter extraction 2020-07-16 15:00:31 +02:00
Mike Fährmann
9cd1bc6907 [mangakakalot] update URL patterns, fix flake8 errors (#876) 2020-07-13 22:47:24 +02:00
jakem72360
7dfdcc3fbf [mangakakalot] Added extractors for MangaKakalot (#876) 2020-07-13 21:20:09 +02:00