missing field html at line 1 column 3685 #49

Open
opened 2025-01-06 19:07:04 +00:00 by rtsisyk · 0 comments
rtsisyk commented 2025-01-06 19:07:04 +00:00 (Migrated from github.com)
./run.sh -c "/home/planet/data/wikipedia/build" "/home/planet/planet/planet-latest.osm.pbf" "/home/planet/data/wikipedia/dumps/20241201/arwiki-NS0-20241201-ENTERPRISE-HTML.json.tar.gz"
2025-01-06T19:05:46Z Using maps build directory '/home/planet/data/wikipedia/build'
2025-01-06T19:05:46Z Building wikiparser
    Finished release [optimized] target(s) in 0.06s
2025-01-06T19:05:46Z Changing to maps build dir '/home/planet/data/wikipedia/build'
2025-01-06T19:05:46Z Extracting articles to '/home/planet/data/wikipedia/descriptions'
2025-01-06T19:05:46Z Extracting '/home/planet/data/wikipedia/dumps/20241201/arwiki-NS0-20241201-ENTERPRISE-HTML.json.tar.gz'
ts=2025-01-06T19:05:46.214800406Z level=info target=om_wikiparser message="om-wikiparser bab29c0-dirty"
ts=2025-01-06T19:05:46.214820143Z level=info target=om_wikiparser::get_articles span= span_path= message="Loading wikipedia/wikidata osm tags from \"osm_tags.tsv\"" pid=1183267
ts=2025-01-06T19:05:47.768026744Z level=warn target=om_wikiparser::get_articles span= span_path= message="830 errors (0.0198%) parsing osm tags from \"osm_tags.tsv\"" pid=1183267
ts=2025-01-06T19:05:47.768054657Z level=info target=om_wikiparser::get_articles span= span_path= message="Processing dump" pid=1183267
ts=2025-01-06T19:05:47.868334774Z level=info target=om_wikiparser::get_articles span=page span_path=>page message="Page without wikidata qid" pid=1183267 lang=ar title="قرية جبعة" url=https://ar.wikipedia.org/wiki/%D9%82%D8%B1%D9%8A%D8%A9_%D8%AC%D8%A8%D8%B9%D8%A9 line=255 byte=23596856
ts=2025-01-06T19:06:07.314459673Z level=error target=om_wikiparser::get_articles span=page span_path=>page message="Error processing article: page has no text after processing" pid=1183267 lang=ar title=بيلز url=https://ar.wikipedia.org/wiki/%D8%A8%D9%8A%D9%84%D8%B2 qid=Q156565 line=33868 byte=4643235306
Error: deserializing json

Caused by:
    missing field `html` at line 1 column 3685
2025-01-06T19:06:36Z ERROR: job failed with exit code 1
``` ./run.sh -c "/home/planet/data/wikipedia/build" "/home/planet/planet/planet-latest.osm.pbf" "/home/planet/data/wikipedia/dumps/20241201/arwiki-NS0-20241201-ENTERPRISE-HTML.json.tar.gz" 2025-01-06T19:05:46Z Using maps build directory '/home/planet/data/wikipedia/build' 2025-01-06T19:05:46Z Building wikiparser Finished release [optimized] target(s) in 0.06s 2025-01-06T19:05:46Z Changing to maps build dir '/home/planet/data/wikipedia/build' 2025-01-06T19:05:46Z Extracting articles to '/home/planet/data/wikipedia/descriptions' 2025-01-06T19:05:46Z Extracting '/home/planet/data/wikipedia/dumps/20241201/arwiki-NS0-20241201-ENTERPRISE-HTML.json.tar.gz' ts=2025-01-06T19:05:46.214800406Z level=info target=om_wikiparser message="om-wikiparser bab29c0-dirty" ts=2025-01-06T19:05:46.214820143Z level=info target=om_wikiparser::get_articles span= span_path= message="Loading wikipedia/wikidata osm tags from \"osm_tags.tsv\"" pid=1183267 ts=2025-01-06T19:05:47.768026744Z level=warn target=om_wikiparser::get_articles span= span_path= message="830 errors (0.0198%) parsing osm tags from \"osm_tags.tsv\"" pid=1183267 ts=2025-01-06T19:05:47.768054657Z level=info target=om_wikiparser::get_articles span= span_path= message="Processing dump" pid=1183267 ts=2025-01-06T19:05:47.868334774Z level=info target=om_wikiparser::get_articles span=page span_path=>page message="Page without wikidata qid" pid=1183267 lang=ar title="قرية جبعة" url=https://ar.wikipedia.org/wiki/%D9%82%D8%B1%D9%8A%D8%A9_%D8%AC%D8%A8%D8%B9%D8%A9 line=255 byte=23596856 ts=2025-01-06T19:06:07.314459673Z level=error target=om_wikiparser::get_articles span=page span_path=>page message="Error processing article: page has no text after processing" pid=1183267 lang=ar title=بيلز url=https://ar.wikipedia.org/wiki/%D8%A8%D9%8A%D9%84%D8%B2 qid=Q156565 line=33868 byte=4643235306 Error: deserializing json Caused by: missing field `html` at line 1 column 3685 2025-01-06T19:06:36Z ERROR: job failed with exit code 1 ```
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference: organicmaps/wikiparser#49
No description provided.