Commit Graph

13 Commits

Author SHA1 Message Date
Mike Fährmann
0655ce1bae [mangakakalot] update domain
the old one still works, but it incurs a redirect
2023-09-19 00:18:21 +02:00
Mike Fährmann
3ecb512722 send Referer headers by default 2023-09-19 00:02:04 +02:00
Mike Fährmann
a453335a9f remove test results in extractor modules
and add generic example URLs
2023-09-11 16:30:55 +02:00
Mike Fährmann
a383eca7f6 decouple extractor initialization
Introduce an 'initialize()' function that does the actual init
(session, cookies, config options) and can called separately from
the constructor __init__().

This allows, for example, to adjust config access inside a Job
before most of it already happened when calling 'extractor.find()'.
2023-07-25 22:16:16 +02:00
Mike Fährmann
c6a9bab019 update extractor test results 2022-07-12 15:49:22 +02:00
Mike Fährmann
e1f501ed14 [mangakakalot] update domain 2022-07-11 00:29:25 +02:00
Mike Fährmann
4c49174579 [mangakakalot] update domain and fix extraction 2021-10-31 02:16:13 +01:00
Mike Fährmann
0265fbda61 [mangakakalot] fix extraction 2021-01-22 17:24:18 +01:00
Mike Fährmann
8e3a324c91 [mangakakalot] ignore "Go Home" buttons in chapter pages 2020-10-23 02:33:35 +02:00
Mike Fährmann
968d3e8465 remove '&' from URL patterns
'/?&#' -> '/?#' and '?&#' -> '?#'

According to https://www.ietf.org/rfc/rfc3986.txt, URLs are
"organized hierarchically" by using "the slash ("/"), question
mark ("?"), and number sign ("#") characters to delimit components"
2020-10-22 23:31:25 +02:00
Mike Fährmann
11b744d971 [mangakakalot] improve/fix chapter extraction 2020-07-16 15:00:31 +02:00
Mike Fährmann
9cd1bc6907 [mangakakalot] update URL patterns, fix flake8 errors (#876) 2020-07-13 22:47:24 +02:00
jakem72360
7dfdcc3fbf [mangakakalot] Added extractors for MangaKakalot (#876) 2020-07-13 21:20:09 +02:00