Latest Tech News

Stay updated with the latest in technology, AI, cybersecurity, and more

Filtered by: bf Clear Filter

Extending That XOR Trick to Billions of Rows

Can we extend the XOR trick for finding one or two missing numbers in a list to finding thousands of missing IDs in a billion-row table? Yes, we can! This is possible using a data structure called an Invertible Bloom Filter (IBF) that compares two sets with space complexity based only on the size of the difference. Using a generalization of the XOR trick [1], all the values that are identical cancel out, so the size of this data structure depends only on the size of the difference. Most explan

Corrected UTF-8 (2022)

UTF-8 is decent and all but it contains some design errors, partly because its original designers just messed up, and partly because of ISO and Unicode Consortium internal politics. We’re probably going to be using it forever so it would be good to correct these design errors before they get any more entrenched than they already have. Corrected UTF-8 is almost the same as UTF-8. We make only three changes: overlength encodings become impossible instead of just forbidden; the C1 controls and the

Topics: 80 bf byte sequence utf

Functions Are Vectors (2023)

Functions are Vectors Conceptualizing functions as infinite-dimensional vectors lets us apply the tools of linear algebra to a vast landscape of new problems, from image and geometry processing to curve fitting, light transport, and machine learning. Prerequisites: introductory linear algebra, introductory calculus, introductory differential equations. This article received an honorable mention in 3Blue1Brown’s Summer of Math Exposition 3! Functions as Vectors Vectors are often first introd

Pbf2sqlite: Reading OpenStreetMap into a SQLite Database

pbf2sqlite A simple command line tool for reading OpenStreetMap .osm.pbf files into a SQLite database. Usage: pbf2sqlite DATABASE [OPTION ...] Options: read FILE Reads .osm or .osm.pbf FILE into the database rtree Add R*Tree indexes addr Add address tables graph Add graph table The command pbf2sqlite test.db read country.osm.pbf reads the OSM PBF file country.osm.pbf and creates in the database test.db the tables. OSM data can be obtained from a provider such as Geofabrik.

Topics: add database file osm pbf