Do not panic on missing .html #50
No reviewers
Labels
No labels
bug
documentation
duplicate
enhancement
good first issue
help wanted
invalid
question
wontfix
No milestone
No project
No assignees
2 participants
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference: organicmaps/wikiparser#50
Loading…
Add table
Reference in a new issue
No description provided.
Delete branch "petrzjunior/missing-html"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Fixes #49
This is an attempt to fix articles with missing
article_body.html
fields. I downloaded the Wikidata Enterprise export for several languages and I notices that some articles are indeed missing the HTML field.For example this article only has
wikitext
, but nohtml
:According to the Wikidata docs, this field is not required.
In the PR, I made the field optional and print a warning in case such article is parsed. Previously, it crashed the process.
cc @rtsisyk
Thanks for the patch! It seems to work!
So what does ultimately happen to such articles? Are they ignored/skipped?
The fix works well @rtsisyk, all data had been processed successfully.
They are not present in the final dump.
@rtsisyk merge?
View command line instructions
Checkout
From your project repository, check out a new branch and test the changes.Merge
Merge the changes and update on Forgejo.Warning: The "Autodetect manual merge" setting is not enabled for this repository, you will have to mark this pull request as manually merged afterwards.