gallery-dl

Author	SHA1	Message	Date
Leonardo Taccari	2655a2ea02	Add support for instagram.com user profiles and pages (#134 ) * [instagram] Add extractor for instagram.com user profiles and pages The extractor scrapes `instagram.com/<user>' timelines and `instagram.com/p/<shortcode>' by mimicking the behaviour of a web browser and extracting the sharedData JSON of the single pages. Please note that this mean that for user timelines we also do an extra request to the `instagram.com/p/<shortcode>' page but this permit to have consistent (and all) information about the media fetched. The MD5 logic used for X-Instagram-GIS was documented in <https://stackoverflow.com/questions/49786980/> * [instagram] Test for keywords, not url for GraphImage and GraphSidecar URLs returned by instagram seems not stable so avoid testing for them and instead test for keyword returned. * [instagram] Improve test of InstagramProfilepageExtractor Also check the count of media returned. * [instagram] Several cleanup and improvements - Change description, subcategories to generate a better description in docs/supportedsite.rst - Remove not needed InstagramExtractor.__init__() - Use text.parse_int() instead of directly using int() (the former is more robust) - Use self.request().json() instead of using json.loads() the self.request().text() - Add `pattern:' to check the URLs where we do not have a stable URLs. It seems that only the subdomain is not stable. Thanks to @mikf!	2018-12-09 12:52:14 +01:00
HRXN	e80ee77d71	tumblr.py: update regex for video (#133 ) There seems to be another sub-domain for videos, apparently.. Not just `vt(.media).tumblr` `vtt(media).tumblr` But also `ve(.media).tumblr`	2018-12-09 09:07:46 +01:00
Mike Fährmann	9a98b6769d	use extractor.request for API calls (#130 ) ... at least for OAuth1.0 based APIs (flickr, smugmug, tumblr)	2018-12-04 21:29:06 +01:00
Mike Fährmann	0225d90078	add exception name and traceback for OSErrors	2018-12-04 19:24:50 +01:00
Mike Fährmann	ad2cefda6b	[tumblr] in case of exception use filename as 'hash' (#129 ) While a filename might not be a real 'hash', or comparable to what tumbler usually provides, it is still better than an empty string. At least as long as "alternatives" in format strings aren't implemented.	2018-12-04 19:15:23 +01:00
Mike Fährmann	95636418ad	[tumblr] catch exception for 'hash' extraction (fixes #129 )	2018-12-02 19:48:09 +01:00
Mike Fährmann	40e30694f3	[pinterest] fix pin.it redirects	2018-12-02 19:38:50 +01:00
Mike Fährmann	770200888e	[gfycat] use public API endpoint	2018-12-02 18:56:53 +01:00
Mike Fährmann	b1e22e8354	release version 1.6.1	2018-11-28 15:34:01 +01:00
Mike Fährmann	5d6e219fb2	[joyreactor] update tests	2018-11-28 14:52:19 +01:00
Mike Fährmann	c59f56fe7e	[gfycat] fix extraction /cajax/get/<id> doesn't work anymore	2018-11-28 13:26:21 +01:00
Mike Fährmann	ba56827f36	[newgrounds] add user-, video-, image-extractors (#119 )	2018-11-27 15:44:53 +01:00
Mike Fährmann	15890930ea	[mangafox] fix extraction use mobile version since desktop version is obfuscated	2018-11-26 16:13:41 +01:00
Mike Fährmann	a4263fb253	[luscious] add extractor for search results (closes #127 )	2018-11-25 18:57:51 +01:00
Mike Fährmann	fb53b5dd55	fix control+c during -j and range tests	2018-11-25 18:54:05 +01:00
Mike Fährmann	a0ae156edc	[pornreactor] add tag-, user-, post-extractors (#114 )	2018-11-23 14:41:26 +01:00
Mike Fährmann	bacbc2e7bd	[joyreactor] try to prevent JsonDecodeErrors (#114 )	2018-11-23 14:32:37 +01:00
Mike Fährmann	503d42a1c2	[joyreactor] add tag-, user-, post-extractors (#114 )	2018-11-23 09:25:02 +01:00
Mike Fährmann	59bb434ba5	[flickr] add ability to download all albums of a user for example with 'https://www.flickr.com/photos/shona_s/albums'	2018-11-23 09:09:37 +01:00
Mike Fährmann	13cb270326	set target directory before postprocessor init (fixes #126 )	2018-11-21 22:21:26 +01:00
Mike Fährmann	9e188f6a21	[4chan] support 4channel.org domain	2018-11-21 17:40:38 +01:00
Mike Fährmann	041bd501fc	[hentaifoundry] unescape YII_CSRF_TOKEN value This fixes the POST requests to /site/filters	2018-11-19 21:46:17 +01:00
Mike Fährmann	b828473aa3	retry HTTP requests for more exception classes	2018-11-19 15:49:13 +01:00
Mike Fährmann	c2e59b9a7d	update CHANGELOG.md [ci skip]	2018-11-18 22:33:35 +01:00
Mike Fährmann	d4b2b73bef	release version 1.6.0	2018-11-17 18:28:02 +01:00
Mike Fährmann	c47482b110	smaller changes, missing docs, etc. - make 'netrc' extractor-specific - rename 'downloader.enable' to 'enabled' - document 'downloader.ytdl.format' - consistent newlines in configuration.rst	2018-11-16 18:18:07 +01:00
Mike Fährmann	b17a5d6f3b	give downloader classes proper names	2018-11-16 14:40:05 +01:00
Mike Fährmann	3c25fa2dad	update build_testresult_db.py script	2018-11-15 22:58:14 +01:00
Mike Fährmann	7f6a0be982	adjust some tests	2018-11-15 22:50:04 +01:00
Mike Fährmann	baad7b0fa5	[twitter] unpack API responses when logged in (closes #123 )	2018-11-14 11:49:35 +01:00
Mike Fährmann	3bdfc15be1	[pinterest] don't crash on pins without image info	2018-11-14 11:46:14 +01:00
Mike Fährmann	8ef84a6823	add option to enable/disable specific downloader modules ... and write URLs with no (active) downloader to unsupported-file	2018-11-13 18:06:36 +01:00
Mike Fährmann	14ee6bf611	[behance] handle external URLs with youtube-dl	2018-11-13 15:10:23 +01:00
Mike Fährmann	36425122ff	[artstation] handle external URLs with youtube-dl	2018-11-13 14:27:02 +01:00
Mike Fährmann	bd8670d925	[gfycat] extend URL pattern	2018-11-11 21:19:11 +01:00
Mike Fährmann	2fa28a2609	update default user-agent string (closes #122 )	2018-11-11 10:07:10 +01:00
Mike Fährmann	7e2d6bcd62	[deviantart] fix original image downloads	2018-11-10 19:16:10 +01:00
Mike Fährmann	9e12e073ab	[2chan] fix extraction	2018-11-10 19:15:21 +01:00
Mike Fährmann	966a9ca3a0	update test results	2018-11-10 19:14:54 +01:00
Mike Fährmann	e26ba682a2	enforce utf-8 encoding for input files (#120 )	2018-11-10 18:27:01 +01:00
Mike Fährmann	a36259d8f1	update setup.py - add Python version check - add classifiers - simplify sys.exit() usage	2018-10-24 14:43:37 +02:00
Mike Fährmann	fd8ed35591	[turboimagehost] fix extraction	2018-10-23 21:08:24 +02:00
Mike Fährmann	c69150f715	[imagefap] fix extraction also adds tags to gallery-metadata and converts suitable values to int	2018-10-20 18:32:25 +02:00
Mike Fährmann	d1f3d32eec	[fallenangels] unescape chapter titles	2018-10-20 18:31:26 +02:00
Mike Fährmann	655549df7c	[downloader:ytdl] add several options The "default" downloader options (rate, retries, timeout, verify) are mapped to corresponding youtube-dl options. downloader.ytdl.logging tells the downloader to pass youtube-dl's output to a Logger object. downloader.ytdl.raw-options allows to pass arbitrary options to the YoutubeDL constructor.	2018-10-20 18:26:49 +02:00
Mike Fährmann	d3d7f01543	add 'prepare()' step for post-processors This allows post-processors to modify the destination path before checking if a file already exists.	2018-10-18 22:32:03 +02:00
Mike Fährmann	c9861ca812	adjust message for status_code based exceptions from: 5xx HTTP Error: Reason to : 5xx: Reason The "HTTP Error" part was in there to emulate Request's error messages from response.raise_for_status(), but it reads a lot better without.	2018-10-18 15:09:49 +02:00
Mike Fährmann	eb1c24b286	[imagebam] detect nonexistent galleries	2018-10-17 15:21:47 +02:00
Mike Fährmann	6ed629f2b6	allow specifying number of skips before abort/exit (closes #115 ) In addition to 'abort' and 'exit', it is now possible to specify 'abort:N' and 'exit:N' (where N is any integer) as value for 'skip' to abort/exit after consecutively skipping N downloads.	2018-10-13 17:21:55 +02:00
Mike Fährmann	e1d306cc48	update unit test results	2018-10-13 16:54:30 +02:00

1 2 3 4 5 ...

1330 Commits