Commit graph

232 commits

Author SHA1 Message Date
Bryan Housel
9b73ff5b58 Reword some of the docs, rename "entries" -> "items"
("entries" sounds more like something that a person entered)
2020-09-22 14:31:48 -04:00
Bryan Housel
1750ef5346 npm run wikidata, npm run dist 2020-09-22 12:24:35 -04:00
Bryan Housel
d0968b4f9b Include all tags in taginfo
(not just those collected from items with brand:wikidata)
2020-09-22 12:09:45 -04:00
Bryan Housel
33629a3feb Add tree param to urls in sitemap, npm run dist to regeneate 2020-09-22 11:46:42 -04:00
Bryan Housel
bea52d5257 Add ability for the build_wikidata to update en labels on Wikidata 2020-09-21 17:48:19 -04:00
Bryan Housel
c8c24692e0 Perform wikidata updates deterministically, npm run wikidata
(re: #3995)
2020-09-21 16:18:26 -04:00
Bryan Housel
d50f4e73bf Add code to build_wikidata.js to automatically update the NSI identifiers
(re: #3995)
2020-09-21 13:41:57 -04:00
Bryan Housel
063ad68e8b Adjust format of config/secrets so we can store wikibase credentials too 2020-09-19 10:48:08 -04:00
Bryan Housel
8c6fac07da Pacify eslint, npm run build 2020-09-19 10:13:52 -04:00
Bryan Housel
a7cd09df27 Same as previous, but commit the brand files too 2020-09-18 10:55:58 -04:00
Bryan Housel
7c4e631a4f Use country-coder to lookup country QID -> isoA2 code
(previously it did extra calls to the Wikidata API)
2020-09-17 11:33:04 -04:00
Bryan Housel
533ceaf341 Update the build_wikidata script, then npm run wikidata 2020-09-17 11:18:21 -04:00
Bryan Housel
43cde611d1 Update the build_dist script, then npm run dist 2020-09-17 09:32:35 -04:00
Bryan Housel
0fd3ac462a Fix validation script, fix lint warnings, remove legacy code 2020-09-16 11:31:09 -04:00
Bryan Housel
59e9cb1a55 Switch default branch from master -> main 2020-09-16 11:15:28 -04:00
Bryan Housel
8a5867d1b4 Rebuild the duplicate name check - this change makes nomatch unnecessary
Now: warn about "new" (no wikidata) entries that may duplicate an "existing" (has wikidata) item.
The criteria for this warning is:
- One of the items has no wikidata
- The items have nearly the same name
- The items have the same locationSet (or the one without wikidata is worldwide)
2020-09-15 14:48:04 -04:00
Bryan Housel
d5bc907910 Don't report logo completeness, that information really lives in wikidata 2020-09-15 09:29:35 -04:00
Bryan Housel
a32cca4875 Add which-polygon dependency and implement location-aware matching 2020-09-14 15:27:14 -04:00
Bryan Housel
67d0e679de Implement matcher.match(k, v, n, loc) 2020-09-14 11:27:18 -04:00
Bryan Housel
67458801b1 WIP updates to build_brands script to use new matcher
And revise code to be more generic - not just for brands.
2020-09-14 09:40:19 -04:00
Bryan Housel
7f9cba5df6 Rewrite matcher to handle new cache structure and identifiers 2020-09-11 22:51:52 -04:00
Bryan Housel
67d60846bd Update build scripts to work with new cache and id
Still need to finish matcher code
2020-09-04 10:51:07 -04:00
Bryan Housel
bf9b3dc021 Settle on 2 caches by id and by path (we need both sometimes)
Restore most of the validation code, still need to redo the matcher
2020-09-03 17:13:58 -04:00
Bryan Housel
abd462c675 Settle on new entries schema
Update file_tree code used for reading/validating/writing
2020-09-02 16:57:13 -04:00
Bryan Housel
25e8926a14 WIP on stable identifier assignment 2020-08-28 16:19:23 -04:00
Bryan Housel
1863a4150a Upgrade to location-conflation v0.5.0
- This gives us *fast* locationSet id, because we can just validate
  the locationSet instead of actually resolving it to GeoJSON.
2020-08-26 17:44:31 -04:00
Bryan Housel
05237a4120 WIP on adding unique identifier to each entry
(re: #3995)

- currently using the format `simplename-hash`  (e.g. "starbucks-f83d44")
- where hash is MD5 fragment of `${tree} ${key} ${value} ${locationID}`

This should generate a reasonable identifier that stays stable until one of those changes.
Also we can eliminate disambiguators as long as same-named brands differ in one of these.
2020-08-20 18:04:56 -04:00
Bryan Housel
b71c7c08ee Use JSON5 for json parsing
(closes #4085)
2020-08-18 10:36:13 -04:00
Bryan Housel
8dc83e44ea Generate a sitemap.xml
(re: #4053)
2020-08-11 09:30:57 -04:00
Bryan Housel
3f4c95d15f Add quebec.geojson
(re: #3162)
2020-06-18 14:53:01 -04:00
Bryan Housel
af2f2ae123 Support custom features under the features/* folder
(re: #3162)
2020-06-17 17:52:02 -04:00
Bryan Housel
27079c7bc1 Move the scripts into scripts/ 2020-06-17 16:00:12 -04:00