Mike Fährmann
15f9bb3d14
add option to disable pyOpenSSL usage ( #508 )
...
(pyOpenSSL is now disabled by default)
2019-12-08 21:21:00 +01:00
Mike Fährmann
c8e99e3b3b
[deviantart] fix crash on missing "token" field ( #505 )
2019-12-07 23:38:36 +01:00
Mike Fährmann
6ed2c7823c
[deviantart] disable original downloads if no cookies set
...
For 'deviation' and 'scraps' extractors only, since original file
downloads for those two will always fail with a 404 Not Found
when not logged in.
2019-12-07 22:39:30 +01:00
Mike Fährmann
50deab5265
[deviantart] fix URL generation from /extended_fetch results
...
(closes #505 )
2019-12-07 22:07:55 +01:00
Mike Fährmann
1f209da4c0
[pixiv] match new search URLs ( closes #507 )
2019-12-06 21:38:38 +01:00
Mike Fährmann
e17907ee2a
change default value of 'cookies-update' to 'true'
2019-12-05 23:43:49 +01:00
Mike Fährmann
07dafad26d
[twitter] attempt to fix infinite loops ( #499 )
...
(Hopefully this doesn't break anything else)
2019-12-03 22:55:29 +01:00
Mike Fährmann
71acbdabf4
[2chan] fix metadata extraction
2019-12-03 17:01:11 +01:00
Mike Fährmann
c0a1241648
[livedoor] force https:// for image URLs
2019-12-03 16:14:05 +01:00
Mike Fährmann
6e23c0da09
[imgur] add extractor for subreddit links ( closes #500 )
2019-12-02 23:44:13 +01:00
Mike Fährmann
372ffe95ee
[oauth] adjust Flickr redirect URI ( fixes #503 )
...
Flickr now automatically forces https:// for all redirect URIs.
2019-12-02 22:24:20 +01:00
Mike Fährmann
004812258d
[hentaifox] fix extraction
2019-12-02 22:21:45 +01:00
Mike Fährmann
e2710702d4
fix Cloudflare bypss
2019-12-01 01:07:24 +01:00
Mike Fährmann
8759403f37
[plurk] add delay between comment requests
2019-12-01 01:03:31 +01:00
Mike Fährmann
a28552fd19
update test results
...
- hbrowse: one tag got removed
- mangoxo: gallery changed owner
- photobucket: ?, but photo still downloads
2019-11-30 23:59:32 +01:00
Mike Fährmann
dcaa3d01bd
[imagefap] adapt to new image URL format
2019-11-30 23:48:02 +01:00
Mike Fährmann
e62c209ca0
[nijie] fix 'date' parsing
2019-11-30 23:08:21 +01:00
Mike Fährmann
3bba763ab9
[twitter] improve
...
- update metadata structure
- combine all user… entries into their own dict
- let 'user' always specify the Timeline owner
- add 'author' entry that specifies the original Tweet author
- create directories per post (closes #491 )
- fix username issues with /i/web/ URLs
2019-11-30 22:30:37 +01:00
Mike Fährmann
db35c3b581
[directlink] separate filenames from paths
...
With this, all default filename formats specify an '{extension}'
and PathFormat.set_extension() reliably works for all files.
2019-11-28 23:50:00 +01:00
Mike Fährmann
41a3169c67
[foolfuuka] use '{extension}' in default filename format
2019-11-28 23:12:48 +01:00
Mike Fährmann
e9aed62c91
[imgur] unescape image titles
2019-11-28 22:13:24 +01:00
Mike Fährmann
2c332edaad
[plurk] fix comment pagination
2019-11-27 19:39:56 +01:00
Mike Fährmann
a3fa45bbb1
[behance] get images from 'media_collection' modules
2019-11-27 01:04:33 +01:00
Mike Fährmann
359c3bc1c5
[deviantart] revert to getting download URLs from OAuth API
...
This commit (partially) reverts 27b5b24 , 94eb7c6 , and a437e78 .
Download URLs from the 'extended_fetch' endpoint are now only
usable for logged in users, while those from the respective
OAuth API endpoint are working again. Everything except
scraps and direct deviation links should be fixed, and those
two categories will work with exported cookies. (#488 )
TODO:
- "native" login with --username and --password
- better handling of internally stored cookies
2019-11-26 23:29:46 +01:00
Mike Fährmann
42b9633c7e
update test results
2019-11-26 23:27:15 +01:00
Mike Fährmann
b28bd1c73e
[bobx] set generated session cookie ( closes #482 )
...
This reverts commit 490831f and also restores original image downloads
by setting a randomly generated session cookie. No login required.
2019-11-25 20:04:11 +01:00
Mike Fährmann
ae09f87602
improve SharedConfigMixin config lookups
2019-11-25 18:31:38 +01:00
Mike Fährmann
f5604492c3
update interface of config functions
2019-11-24 00:42:28 +01:00
Mike Fährmann
4ca883c66f
[smugmug] replace test for custom URLs
...
The old one (http://www.creativedogportraits.com/ ) is empty and/or
no longer handled by SmugMug.
2019-11-22 23:25:55 +01:00
Mike Fährmann
d45fabb79d
match user profile handling on deviantart and newgrounds
2019-11-22 23:20:21 +01:00
Mike Fährmann
ea80dadd09
[deviantart] restore archive keys
...
Commit 9fdc5e7 changed 'username' fields to have consistent
capitalization, but that invalidated the archive keys of several
extractors where 'username' was usually lowercase.
2019-11-21 17:00:08 +01:00
Mike Fährmann
ea094692c8
[vsco] fix collection extraction ( #480 )
2019-11-20 22:06:23 +01:00
Mike Fährmann
490831f84a
[bobx] "fix" image download URLs
...
Access to original images got restricted to (paid) members only.
All that's publicly accessible now are essentially preview pictures.
2019-11-20 21:59:37 +01:00
Mike Fährmann
978cb03f81
update misc test results
...
- Livedoor now uses https:// for its image URLs
- Instagram image URLs got simplified
2019-11-20 21:45:48 +01:00
Mike Fährmann
fca87974fe
[sexcom] fix video downloads by sending specific Referer headers
2019-11-19 23:52:34 +01:00
Mike Fährmann
edc080468d
[instagram] make 'video_url' fields optional ( fixes #479 )
...
[ci skip]
2019-11-19 11:18:43 +01:00
Mike Fährmann
9fdc5e74cb
[deviantart] ensure consistent username capitalization ( #455 )
...
The 'username' field was capitalized in a very inconsistent manner:
Either all lowercase, or as given by the input URL, or with the
"original" capitalization, depending on the extractor used among
other things.
Now usernames use their original capitalization for all extractors.
('UserName' instead of 'username' or 'uSeRnAmE')
2019-11-18 22:09:58 +01:00
Mike Fährmann
b1f0609de5
[newgrounds] rewrite ( #394 )
...
- restructure extractor hierarchy
- extract more metadata
- extract videos without youtube-dl
- be more resilient to errors
TODO:
- favorites
- games, but that might be near impossible for non-flash titles
2019-11-18 21:13:33 +01:00
Mike Fährmann
3ece3976ae
[newgrounds] implement login support ( #394 )
2019-11-16 23:45:32 +01:00
Mike Fährmann
3a07c06865
[newgrounds] update
...
- create directory per post
- rename variables and methods
2019-11-14 23:17:14 +01:00
Mike Fährmann
5513b66eb0
[vsco] fix user profile extraction
2019-11-12 23:36:48 +01:00
Mike Fährmann
abfcb356fc
[flickr] support 3k, 4k, 5k, and 6k photo sizes ( closes #472 )
2019-11-10 17:52:51 +01:00
Mike Fährmann
521fcd2eb9
[imgbb] fix error in galleries without user info ( closes #471 )
2019-11-10 17:10:51 +01:00
Mike Fährmann
8061263d4c
[imgbb] improve pagination logic
...
- avoid unnecessary API calls for small or empty galleries
- combine duplicate code
2019-11-10 17:07:27 +01:00
Mike Fährmann
da6789b2b0
disable unique archive id checks for some tests
...
- same image twice in a livedoor blog post
- unreliable results for related pinterest items
2019-11-10 17:04:51 +01:00
Mike Fährmann
b0197098e6
[imgur] get title from webpage if missing in API response
...
(closes #467 )
2019-11-07 21:10:04 +01:00
Mike Fährmann
dd5d2b2eac
[deviantart] add user profile extractor ( #377 , #419 )
2019-11-07 18:29:49 +01:00
Mike Fährmann
a437e78620
[deviantart] minimize cookie usage during scraps extraction
...
(#445 )
2019-11-05 21:55:13 +01:00
Mike Fährmann
1a197d2195
store the original cookiejar as Extractor._cookiejar
2019-11-05 21:53:22 +01:00
Mike Fährmann
de83ae4576
make 'method' argument of Extractor.request keyword-only
2019-11-05 17:28:09 +01:00