-
Notifications
You must be signed in to change notification settings - Fork 276
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
extract more useful images #515
Comments
Hope to have a look at this before the next extraction. |
I was told to have a look at the ImageExtractor for this issue. Surprisingly the problems are not the caused by the ImageExtractor, because it extracts images to |
@Termilion Could I take a look at the images for
You're right. Checked::
|
You've done a lot more than just images from infoboxes!
|
|
|
Excellent work @Termilion and worth to present at Semantics 2017 DBpedia day, if you're going there. I wonder how this harvest from the page compares to Commons lists: This is extracted as RDF:
I'd say keep them. @chile12 and @jimkont, how to approach dbpedia/ontology-tracker#19? Maybe you can add it as an item for the meeting? (I won't be there). |
Thanks @VladimirAlexiev, but sadly I won't be able to go to the Semantics this year.
This should now be ready to be merged after the properties are updated. |
@VladimirAlexiev recently we have introduced a testing methodology, see our submission for semantics https://svn.aksw.org/papers/2020/semantics_marvin/public.pdf So most of the issues can be captured there. My question is: is there smth from this thread that we can define as test? \cc @Vehnem |
This bug is about extracting more useful images from Wikipedia. This PR is related: #470.
The Berlin page as it was extracted in dbpedia has a number of images on wikipedia:
These are extracted as follows http://dbpedia.org/page/Berlin:
Bugs:
multiple image
template extractor should extract filenames fully (not stop at digits) and turn them into actonable linkswiki-commons:Special:FilePath/*
image_photo={{Photomontage|...photo[0-9]+[a-z]*=
The Lindsay Anderson page as it was extracted in dbpedia has the following image-related info on wikipedia:
http://dbpedia.org/page/Lindsay_Anderson extracts only
dbp:imagesize
.Bug: extract
dbp:image
and turn it into an actonable linkwiki-commons:Special:FilePath/*
.Note: #133 discusses images in the
en
namespace vs thecommons
namespace, so "actionable link" may not always mean "prepend acommons
namespace", someone needs to research this.The text was updated successfully, but these errors were encountered: