Skip to content

Some image links are not extracted #388

@NicolasSandoval

Description

@NicolasSandoval

Description

Image links like maps with PNG format are left out.

2021-04-17 17:35:45,509  images.extract       WARNING  Unsupported image with GET args. Won't be included: extensions/ImageMap/resources/desc-20.png?15600
2021-04-17 17:35:48,371  images.extract       WARNING  Unsupported image with GET args. Won't be included: extensions/ImageMap/resources/desc-20.png?15600
2021-04-17 17:35:49,997  images.extract       WARNING  Unsupported image with GET args. Won't be included: extensions/ImageMap/resources/desc-20.png?15600
2021-04-17 17:35:53,665  images.extract       WARNING  Unsupported image type. Won't be included: 'https://maps.wikimedia.org/img/osm-intl,11,40.4308,-3.7031,266x250.png?lang=es'
2021-04-17 17:35:54,902  images.extract       WARNING  Unsupported image type. Won't be included: 'https://maps.wikimedia.org/img/osm-intl,9,40.67,-73.94,275x250.png?lang=es'
2021-04-17 17:35:55,537  images.extract       WARNING  Unsupported image type. Won't be included: 'https://maps.wikimedia.org/img/osm-intl,10,-34.599722222222,-58.381944444444,300x300.png?lang=es&domain=es.wikipedia.org&title=Buenos+Aires&groups=_1f2405ce1888c1041823dcda962c8595b0609749'
2021-04-17 17:35:55,890  images.extract       WARNING  Unsupported image with GET args. Won't be included: extensions/ImageMap/resources/desc-20.png?15600
2021-04-17 17:35:56,701  images.extract       WARNING  Unsupported image with GET args. Won't be included: extensions/ImageMap/resources/desc-20.png?15600
2021-04-17 17:35:56,702  images.extract       WARNING  Unsupported image with GET args. Won't be included: extensions/ImageMap/resources/desc-20.png?15600
2021-04-17 17:35:57,699  images.extract       WARNING  Unsupported image with GET args. Won't be included: extensions/ImageMap/resources/desc-20.png?15600
2021-04-17 17:35:59,303  images.extract       WARNING  Unsupported image type. Won't be included: 'https://maps.wikimedia.org/img/osm-intl,11,48.8565,2.345,266x266.png?lang=es'
2021-04-17 17:36:02,625  images.extract       WARNING  Unsupported image type. Won't be included: 'https://maps.wikimedia.org/img/osm-intl,11,35.6879633,139.7690085,275x250.png?lang=es'
2021-04-17 17:36:05,072  images.extract       WARNING  Unsupported image with GET args. Won't be included: extensions/ImageMap/resources/desc-20.png?15600
2021-04-17 17:36:06,057  images.extract       WARNING  Unsupported image type. Won't be included: 'https://maps.wikimedia.org/img/osm-intl,11,41.388,2.168,266x250.png?lang=es'
2021-04-17 17:36:09,118  images.extract       WARNING  Unsupported image type. Won't be included: 'https://maps.wikimedia.org/img/osm-intl,10,-33.44889,-70.669265,275x250.png?lang=es'
2021-04-17 17:36:11,183  images.extract       WARNING  Unsupported image type. Won't be included: 'https://maps.wikimedia.org/img/osm-intl,12,16.753056,-93.115556,255x255.png?lang=es'
2021-04-17 17:36:14,544  images.extract       WARNING  Unsupported image type. Won't be included: 'https://maps.wikimedia.org/img/osm-intl,11,25.6715,-100.3085,200x200.png?lang=es'

and

2021-04-20 14:04:50,462  images.extract       WARNING  Unsupported image with GET args. Won't be included: extensions/wikihiero/img/hiero_G1.png?4d556

Possible Solution

Links that not start with maps are excluded by ImageParser.replace of the extract.py module, perhaps such links could be included.

Another point to consider is why some links do not pass if they have the ? symbol without having lang after.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions