Commit Graph

15 Commits

Author SHA1 Message Date
Mike Fährmann
968d3e8465 remove '&' from URL patterns
'/?&#' -> '/?#' and '?&#' -> '?#'

According to https://www.ietf.org/rfc/rfc3986.txt, URLs are
"organized hierarchically" by using "the slash ("/"), question
mark ("?"), and number sign ("#") characters to delimit components"
2020-10-22 23:31:25 +02:00
Mike Fährmann
846d3a2466 [sexcom] replace 404ed test 2020-05-18 19:04:51 +02:00
Mike Fährmann
32e36d8f02 [sexcom] replace tests 2020-03-17 22:47:45 +01:00
Mike Fährmann
4e361b3008 add tests for specific datetime values 2020-02-23 16:48:30 +01:00
Mike Fährmann
b38cf59711 [sexcom] fix image URLs & parse 'date' fields 2020-02-04 22:52:00 +01:00
Mike Fährmann
fca87974fe [sexcom] fix video downloads by sending specific Referer headers 2019-11-19 23:52:34 +01:00
Mike Fährmann
f15eedb634 [sexcom] set Referer header for file downloads (closes #464) 2019-11-03 13:27:58 +01:00
Mike Fährmann
4409d00141 embed error messages in StopExtraction exceptions 2019-10-28 16:39:49 +01:00
Mike Fährmann
ef17d94469 update test results 2019-10-21 21:53:21 +02:00
Mike Fährmann
23251356cb require 'extension' data for each URL (#382) 2019-08-14 20:03:03 +02:00
Mike Fährmann
fdec59f8e2 replace extractor.request() 'expect' argument
with
- 'fatal': allow 4xx status codes
- 'notfound': raise NotFoundError on 404
2019-07-05 00:42:16 +02:00
Mike Fährmann
b4da8c5a97 [sexcom] add extractor for related pins (#325) 2019-07-03 21:04:23 +02:00
Mike Fährmann
69997e92db [sexcom] skip unavailable pins (#325) 2019-07-02 22:05:54 +02:00
Mike Fährmann
0318c610dc [sexcom] add extractor for search results (#147) 2019-04-24 22:10:01 +02:00
Mike Fährmann
a247c94c34 [sexcom] add pin and board extractors (#147) 2019-04-24 22:09:19 +02:00