Sergey M․
41d06b0424
[extractor/common] Improve _request_webpage
...
* Do not ignore data, headers and query for Requests
* Default values for headers and query switched to dicts since these are used by urllib itself
2016-03-31 22:58:38 +06:00
Sergey M․
b22ca76204
[extractor/common] Filter out unsupported encrypted media for f4m formats ( Closes #8573 )
2016-03-27 07:42:38 +06:00
Sergey M․
19dbaeece3
Remove _sort_formats from _extract_*_formats methods
...
Now _sort_formats should be called explicitly.
_sort_formats has been added to all the necessary places in code.
Closes #8051
2016-03-27 07:03:08 +06:00
Sergey M․
15707c7e02
[compat] Add compat_urllib_parse_urlencode and eliminate encode_dict
...
encode_dict functionality has been improved and moved directly into compat_urllib_parse_urlencode
All occurrences of compat_urllib_parse.urlencode throughout the codebase have been replaced by compat_urllib_parse_urlencode
Closes #8974
2016-03-26 01:46:57 +06:00
remitamine
49dea4913b
Merge pull request #8513 from remitamine/dash-sort
...
[extractor/common] fix dash formats sorting
2016-03-15 18:39:50 +01:00
Sergey M․
0fdbb3322b
[extractor/common] Add _parse_f4m_formats routine
2016-03-13 03:16:08 +06:00
remitamine
09f572fbc0
[extractor/common] add transform_source to _download_smil and _extract_smil_formats
2016-03-11 22:37:07 +01:00
remitamine
15bf934de5
Merge pull request #8819 from remitamine/simple-webpage-requests
...
[extractor/common] simplify using data, headers and query params with _download_* methods
2016-03-11 18:19:43 +01:00
remitamine
cdfee16818
[extractor/common] add data, headers and query params to _request_webpage
2016-03-11 18:12:50 +01:00
Yen Chi Hsuan
a6c8b75904
[common] Use mimeType to determine file extensions ( #8766 )
2016-03-11 23:51:42 +08:00
Yen Chi Hsuan
64f08d4ff2
Merge pull request #8766 from yan12125/dash-detect-ext
...
Detect file extensions of DASH formats from their codecs
2016-03-11 21:40:07 +08:00
Yen Chi Hsuan
af7d5a63b2
[common] Document protocol http_dash_segments
2016-03-06 17:47:07 +08:00
Yen Chi Hsuan
2def60c5f3
[common] Use codec2ext for DASH formats ( #8764 )
2016-03-05 18:18:39 +08:00
Yen Chi Hsuan
e9c0cdd389
[jython] Introduce compat_os_name
...
os.name is always 'java' on Jython
2016-03-03 19:24:24 +08:00
Sergey M․
7bcd2830dd
[extractor/common] Document uploader_url
2016-03-02 23:31:24 +06:00
Sergey M․
2bc0c46f98
[extractor/common] Document license metafield
2016-03-02 23:06:39 +06:00
Sergey M․
d77ab8e255
Add --mark-watched feature ( Closes #5054 )
2016-03-01 01:01:33 +06:00
Sergey M․
9cdffeeb3f
[extractor/common] Clarify rationale on media playlist detection
2016-02-27 07:01:11 +06:00
Sergey M․
fbb6edd298
[extractor/common] Properly extract audio only formats in master m3u8 playlists
2016-02-27 06:48:13 +06:00
Sergey M․
f5bdb44443
[extractor/common] Add _remove_duplicate_formats
2016-02-22 01:19:39 +06:00
remitamine
cafcf657a4
add more subtitles mime types to mimetype2ext and fix the platform subtitle extraction
2016-02-20 22:02:03 +01:00
Sergey M․
611c1dd96e
[refactor] Single quotes consistency
2016-02-14 15:37:17 +06:00
Sergey M․
d800609c62
[refactor] Do not specify redundant None as second argument in dict.get()
2016-02-14 14:25:04 +06:00
Sergey M․
bb20526b64
[extractor/common] Improve base url construction
2016-02-13 00:13:56 +06:00
remitamine
c349456ef6
[extractor/common] strip http urls in smil manifest
2016-02-12 17:38:48 +01:00
remitamine
81e1c4e2fc
[extractor/common] remove duplicate rtmp formats in smil manifest
2016-02-11 17:58:48 +01:00
remitamine
dd86780596
[extractor/common] fix dash formats sorting
2016-02-11 10:55:50 +01:00
remitamine
154c209e2d
[extractor/common] improve dash format ids
2016-02-11 10:33:26 +01:00
remitamine
51e9094f4a
[extractor/common] extract youtube dash formats filesize( fixes #8480 )
2016-02-09 20:05:39 +01:00
remitamine
d413095f7e
[extractor/common] remove duplicated formats and subtiles in smil manifests
2016-02-09 17:15:41 +01:00
remitamine
6a3828fddd
[common] use float conversion instead of using division from __future__
2016-02-06 14:27:04 +01:00
remitamine
91cb6b5065
rename _parse_mpd to _parse_mpd_formats and add default value for mpd namespace
2016-02-06 14:03:48 +01:00
remitamine
0826a0b555
[common] sort dash formats
2016-02-06 06:52:48 +01:00
remitamine
255732f0d3
[common] fix segment duration calculation
2016-02-03 23:57:08 +01:00
remitamine
53c269c6fd
[common] fix media_template string formating
2016-02-03 23:54:34 +01:00
remitamine
675d001633
[common] skip drm protected dash formats
2016-02-03 18:44:43 +01:00
remitamine
d577c79632
[common] ignore ISO 639-2 generic codes
2016-02-03 13:24:07 +01:00
remitamine
f14be22816
[common] remove duplicate reference to namespace
2016-02-02 22:02:08 +01:00
remitamine
9c74423510
[common] fix media template regex
2016-02-02 18:30:31 +01:00
remitamine
1bac34556f
[common] add a generic support for mpd manifests
2016-02-02 18:09:25 +01:00
Yen Chi Hsuan
2d2fa82d17
[common] Add _extract_dash_manifest_formats
2016-01-30 22:52:23 +08:00
Yen Chi Hsuan
c94678957f
[common] Remove unused arguments
2016-01-30 22:45:16 +08:00
Yen Chi Hsuan
16f38a699f
[common] Rename to namespace
...
For consistency with _parse_smil_*
2016-01-30 22:40:56 +08:00
Yen Chi Hsuan
df374b5222
[common] Prefer the manifest than formats_dict in determining codecs
2016-01-30 21:42:27 +08:00
Yen Chi Hsuan
5ea1eb78f5
[common] Fix for youtube
2016-01-30 21:36:01 +08:00
Yen Chi Hsuan
b323e1707d
[common] Modify _parse_dash_manifest for use in Facebook
2016-01-30 21:27:43 +08:00
Yen Chi Hsuan
17b598d30c
[common] _parse_dash_manifest() from youtube.py
2016-01-30 21:05:55 +08:00
Sergey M․
350cf045d8
[extractor/common] Restrict checks when auto calculating tbr
2016-01-30 01:47:46 +06:00
remitamine
a9d5f12fec
Merge pull request #8328 from remitamine/hls-master-detect
...
[extractor/common] detect media playlist in _extract_m3u8_formats
2016-01-27 18:07:30 +01:00
remitamine
7f32e5dc35
[extractor/common] detect media playlist in _extract_m3u8_formats
2016-01-27 17:53:42 +01:00
Sergey M․
b0d21deda9
[extractor/common] Auto calculate tbr when missing
2016-01-27 21:11:17 +06:00
Yen Chi Hsuan
77f785076f
[common] Keep full codec name from m3u8 manifests
...
See #8293 . This is for consistency between YouTube and HLS formats.
2016-01-25 01:03:46 +08:00
Yen Chi Hsuan
0b26ba3fc8
[extractor/common] Allow passing more parameters to _search_json_ld
2016-01-16 20:45:36 +08:00
Sergey M․
4ca2a3cf3c
[extractor/common] Add initial support for JSON-LD metadata extraction into info_dict
2016-01-16 00:36:02 +06:00
Jakub Wilk
dfb1b1468c
Fix typos
...
Closes #8200 .
2016-01-10 17:24:28 +01:00
Sergey M
3f3343cd3e
Merge pull request #8061 from dstftw/introduce-chapter-and-series-fields
...
Introduce chapter and series fields
2016-01-03 03:54:22 +06:00
Sergey M․
27bfd4e526
[extractor/common] Introduce number fields for chapters and series
2016-01-01 20:26:56 +06:00
Philipp Hagemeister
32f9036447
[ccc] Add language information to formats
2016-01-01 13:28:45 +01:00
Sergey M․
7109903e61
[extractor/common] Document chapter and series fields
2015-12-31 03:10:44 +06:00
Sergey M․
7e5edcfd33
Simplify formats accumulation for f4m/m3u8/smil formats
...
Now all _extract_*_formats routines return a list
2015-12-29 00:58:24 +06:00
remitamine
39d60b715a
Merge pull request #7769 from remitamine/sort
...
[common] lower (m3u8,rtmp,rtsp) format preference only if required program is not available
2015-12-28 19:15:14 +01:00
remitamine
d497a201ca
[common] use specific variable for protocol preference in _sort_formats
2015-12-28 18:45:01 +01:00
remitamine
8d29e47f54
[common] simplify the use of _extract_m3u8_formats and _extract_f4m_formats
2015-12-27 15:33:39 +01:00
Sergey M․
9b9c5355e4
Rename error_to_str to error_to_compat_str
2015-12-20 07:00:39 +06:00
Sergey M․
7f8b271465
Properly convert errors to strings
2015-12-20 05:27:38 +06:00
Sergey M․
dd85e4d707
[extractor/common] Properly decode error string on python 2 ( Closes #1354 , closes #3957 , closes #4037 , closes #6449 )
2015-12-20 02:43:50 +06:00
Sergey M․
62d231c004
[extractor/common] Clarify duration can be float
2015-12-03 20:55:02 +06:00
Sergey M?
5c2266df4b
Switch codebase to use sanitized_Request instead of
...
compat_urllib_request.Request
[downloader/dash] Use sanitized_Request
[downloader/http] Use sanitized_Request
[atresplayer] Use sanitized_Request
[bambuser] Use sanitized_Request
[bliptv] Use sanitized_Request
[brightcove] Use sanitized_Request
[cbs] Use sanitized_Request
[ceskatelevize] Use sanitized_Request
[collegerama] Use sanitized_Request
[extractor/common] Use sanitized_Request
[crunchyroll] Use sanitized_Request
[dailymotion] Use sanitized_Request
[dcn] Use sanitized_Request
[dramafever] Use sanitized_Request
[dumpert] Use sanitized_Request
[eitb] Use sanitized_Request
[escapist] Use sanitized_Request
[everyonesmixtape] Use sanitized_Request
[extremetube] Use sanitized_Request
[facebook] Use sanitized_Request
[fc2] Use sanitized_Request
[flickr] Use sanitized_Request
[4tube] Use sanitized_Request
[gdcvault] Use sanitized_Request
[extractor/generic] Use sanitized_Request
[hearthisat] Use sanitized_Request
[hotnewhiphop] Use sanitized_Request
[hypem] Use sanitized_Request
[iprima] Use sanitized_Request
[ivi] Use sanitized_Request
[keezmovies] Use sanitized_Request
[letv] Use sanitized_Request
[lynda] Use sanitized_Request
[metacafe] Use sanitized_Request
[minhateca] Use sanitized_Request
[miomio] Use sanitized_Request
[meovideo] Use sanitized_Request
[mofosex] Use sanitized_Request
[moniker] Use sanitized_Request
[mooshare] Use sanitized_Request
[movieclips] Use sanitized_Request
[mtv] Use sanitized_Request
[myvideo] Use sanitized_Request
[neteasemusic] Use sanitized_Request
[nfb] Use sanitized_Request
[niconico] Use sanitized_Request
[noco] Use sanitized_Request
[nosvideo] Use sanitized_Request
[novamov] Use sanitized_Request
[nowness] Use sanitized_Request
[nuvid] Use sanitized_Request
[played] Use sanitized_Request
[pluralsight] Use sanitized_Request
[pornhub] Use sanitized_Request
[pornotube] Use sanitized_Request
[primesharetv] Use sanitized_Request
[promptfile] Use sanitized_Request
[qqmusic] Use sanitized_Request
[rtve] Use sanitized_Request
[safari] Use sanitized_Request
[sandia] Use sanitized_Request
[shared] Use sanitized_Request
[sharesix] Use sanitized_Request
[sina] Use sanitized_Request
[smotri] Use sanitized_Request
[sohu] Use sanitized_Request
[spankwire] Use sanitized_Request
[sportdeutschland] Use sanitized_Request
[streamcloud] Use sanitized_Request
[streamcz] Use sanitized_Request
[tapely] Use sanitized_Request
[tube8] Use sanitized_Request
[tubitv] Use sanitized_Request
[twitch] Use sanitized_Request
[twitter] Use sanitized_Request
[udemy] Use sanitized_Request
[vbox7] Use sanitized_Request
[veoh] Use sanitized_Request
[vessel] Use sanitized_Request
[vevo] Use sanitized_Request
[viddler] Use sanitized_Request
[videomega] Use sanitized_Request
[viewvster] Use sanitized_Request
[viki] Use sanitized_Request
[vk] Use sanitized_Request
[vodlocker] Use sanitized_Request
[voicerepublic] Use sanitized_Request
[wistia] Use sanitized_Request
[xfileshare] Use sanitized_Request
[xtube] Use sanitized_Request
[xvideos] Use sanitized_Request
[yandexmusic] Use sanitized_Request
[youku] Use sanitized_Request
[youporn] Use sanitized_Request
[youtube] Use sanitized_Request
[patreon] Use sanitized_Request
[extractor/common] Remove unused import
[nfb] PEP 8
2015-11-23 21:56:23 +06:00
Sergey M․
019839faaa
[extractor/common] Use baseURL from f4m manifest for recursive manifest extraction
2015-11-21 18:01:39 +06:00
Sergey M
30eecc6a04
Merge pull request #7296 from jaimeMF/xml_attrib_unicode
...
Use a wrapper around xml.etree.ElementTree.fromstring in python 2.x (…
2015-10-31 18:15:21 +00:00
Sergey M․
dbd82a1d4f
[extractor/common] Fix m3u8 extraction on failure
2015-11-01 00:01:34 +06:00
Sergey M․
dc519b5421
[extractor/common] Make ie_key and IE_NAME return unicode string
2015-10-31 23:12:57 +06:00
Jaime Marquínez Ferrándiz
36e6f62cd0
Use a wrapper around xml.etree.ElementTree.fromstring in python 2.x ( #7178 )
...
Attributes aren't unicode objects, so they couldn't be directly used in info_dict fields (for example '--write-description' doesn't work with bytes).
2015-10-25 20:13:16 +01:00
remitamine
3711304510
[extractor/common] get the redirected m3u8_url in _extract_m3u8_formats
2015-10-24 19:01:54 +06:00
Jaime Marquínez Ferrándiz
865d1fbafc
[extractor/common] Remove unused import
2015-10-24 12:39:23 +02:00
Sergey M․
943a1e24b8
[extractor/common] Use more generic URLError in _is_valid_url
2015-10-24 16:25:04 +06:00
Sergey M․
02835c6bf4
[extractor/common] Document repost_count
2015-10-18 09:34:54 +06:00
Sergey M․
448ef1f31c
[extractor/common] Allow angle brackets in attributes in _og_regexes ( #7215 )
2015-10-18 09:11:02 +06:00
Sergey M․
7a6d76a64d
[extractor/common] Require closing quote in _og_regexes ( Closes #7174 )
...
E.g. do not match `property='og:video:type'` when `og:video` is requested.
2015-10-14 20:49:39 +06:00
Sergey M․
4180a3d8b7
[extractor/common] Allow quoteless content attribute in og regexes ( Closes #7115 )
2015-10-10 01:46:01 +06:00
Yen Chi Hsuan
57935b2564
[extractor/common] Allow HTML5 unquoted attribute values
...
Fixes #7108
HTML5 allows unquoted attribute values. See the "Unquoted attribute value
syntax" section [1] for more information
[1] http://www.w3.org/TR/html5/syntax.html
2015-10-09 14:11:00 +08:00
Sergey M․
4bba371644
[YoutubeDL] Autocalculate ext for subtitles when missing
2015-10-04 20:42:26 +06:00
Sergey M․
e5851b963a
[extractor/common] Make f4m extraction for SMIL non fatal
2015-10-01 23:04:56 +06:00
Sergey M․
4de6131090
[extractor/common] Add fatal to _extract_f4m_formats
2015-10-01 23:03:31 +06:00
Sergey M․
3a1341a7bc
[extractor/common] Make m3u8 extraction for SMIL non fatal
2015-10-01 22:59:20 +06:00
Sergey M․
c78e48177c
[extractor/common] Check validity of direct URLs
2015-10-01 22:54:54 +06:00
Sergey M․
647eab4541
[extractor/common] Extract upload date from SMIL
2015-10-01 22:20:28 +06:00
Sergey M․
1e5bcdec02
[extractor/common] Extract images from SMIL
2015-10-01 22:20:21 +06:00
Sergey M․
e7d8e98a9f
[extractor/common] Allow float bitrates
2015-10-01 22:20:15 +06:00
Sergey M․
8aab976bbd
[extractor/common] Document release_date field
2015-09-26 21:07:54 +06:00
Sergey M․
c430802e32
[extractor/common] Add raise_geo_restricted
2015-09-22 21:50:20 +06:00
Sergey M․
586f1cc532
[extractor/common] Skip html comment tags ( Closes #6822 )
2015-09-11 21:07:32 +06:00
Sergey M․
73eb13dfc7
[extractor/common] Case insensitive inputs extraction
2015-09-11 20:43:05 +06:00
Sergey M․
be0e5dbd83
[extractor/common] Extract submit inputs
2015-09-06 07:20:47 +06:00
Sergey M․
43e7d3c945
[extractor/common] Add raise_login_required
2015-08-26 21:24:47 +06:00
Jaime Marquínez Ferrándiz
8c97f81943
[common] Follow convention of using 'cls' in classmethods
2015-08-21 11:35:51 +02:00
Yen Chi Hsuan
f738dd7b7c
[common] Remove debugging codes
2015-08-21 01:43:22 +08:00
Yen Chi Hsuan
912e0b7e46
[common] Add _merge_subtitles()
2015-08-21 01:37:07 +08:00
Yen Chi Hsuan
03bc7237ad
[common] _parse_smil_subtitles: accept lang
as the subtitle language
2015-08-20 23:18:58 +08:00
Sergey M․
5cdefc4625
[extractor/common] Add more subtitle mime types for guess when ext is missing
2015-08-20 01:02:50 +06:00