Commit Graph

24 Commits

Author SHA1 Message Date
Mike Fährmann
580baef72c change Chapter and MangaExtractor classes
- unify and simplify constructors
- rename get_metadata and get_images to just metadata() and images()
- rename self.url to chapter_url and manga_url
2019-02-11 18:38:47 +01:00
Mike Fährmann
4b1880fa5e propagate 'match' to base extractor constructor 2019-02-11 13:31:10 +01:00
Mike Fährmann
6284731107 simplify extractor constants
- single strings for URL patterns
- tuples instead of lists for 'directory_fmt' and 'test'
- single-tuple tests where applicable
2019-02-08 13:45:40 +01:00
Mike Fährmann
6126615698 update URLs for supportedsites.rst 2019-01-30 16:18:22 +01:00
Mike Fährmann
95392554ee use text.urljoin() 2018-04-26 17:00:26 +02:00
Mike Fährmann
cc36f88586 rename safe_int to parse_int; move parse_* to text module 2018-04-20 14:53:21 +02:00
Mike Fährmann
d122203be1 [mangastream] fix extraction 2018-02-17 22:40:16 +01:00
Mike Fährmann
3cec533c28 Merge branch 'archive' 2018-02-12 18:07:58 +01:00
Mike Fährmann
20af86b2ea add more extractor tests
for mangastream, reddit and imgur
2018-02-12 17:07:18 +01:00
Mike Fährmann
5b3c34aa96 use generic chapter-extractor in more modules 2018-02-07 12:36:39 +01:00
Mike Fährmann
34873dbd90 set 'archive_fmt' values
These are going to be used to create an unique id for each image.
2018-02-01 15:30:49 +01:00
Mike Fährmann
69cbc0619f [mangastream] fix 'next-page' URLs (fixes #49) 2017-11-04 11:50:40 +01:00
Mike Fährmann
633b376f35 improve/adjust default filename formats for manga sites 2017-10-02 19:06:24 +02:00
Mike Fährmann
9fc1d0c901 implement and use 'util.safe_int()'
same as Python's 'int()', except it doesn't raise any exceptions and
accepts a default value
2017-09-24 15:59:25 +02:00
Mike Fährmann
91ad8ac2b3 [mangastream] adapt to domain-change to readms.net 2017-04-01 21:42:36 +02:00
Mike Fährmann
94e10f249a code adjustments according to pep8 nr2 2017-02-01 00:53:19 +01:00
Mike Fährmann
19c2d4ff6f remove explicit (sub)category keywords 2016-09-25 14:22:07 +02:00
Mike Fährmann
d7e168799d consistent extractor naming scheme + docstrings 2016-09-12 10:34:31 +02:00
Mike Fährmann
221b3869cf [mangastream] adapt to new page structure 2016-07-17 20:02:25 +02:00
Mike Fährmann
b5e62ec308 [mangastream] adjust regex agein (fixes #2) 2016-04-22 18:00:58 +02:00
Mike Fährmann
7af8ae509c [mangastream] adjust regex 2016-04-20 08:28:57 +02:00
Mike Fährmann
4d56b76aa8 update all other extractors 2015-11-21 04:26:30 +01:00
Mike Fährmann
c2f0720184 code cleanup to use nameext_from_url 2015-11-16 17:32:26 +01:00
Mike Fährmann
36b376b5ba [mangastream] add extractor 2015-11-08 00:03:14 +01:00