Why This Matters
The release of the FUTO Swipe dataset marks a significant advancement in swipe typing technology, providing a large and high-quality resource for developing more accurate and efficient predictive models. This development benefits both the tech industry and consumers by enabling the creation of smarter mobile input systems that improve user experience. The open availability of the dataset encourages innovation and collaboration in the field of mobile keyboard technology.
Key Takeaways
- Provides over 1 million high-quality swipe data points for model training.
- Enhances the development of more accurate swipe typing systems.
- Open licensing fosters innovation and collaboration in mobile input technology.
In August 2024, we launched a dataset collection effort on the swipe.futo.org domain to collect QWERTY English swipes. Users would voluntarily visit the webpage on their mobile phone and be given instructions and information about the dataset. After consenting, they would be given sentences, primarily from Wikipedia, and would be asked to swipe them word-by-word.
In the end, this produced over 1 million swipes. We filtered out a small set of low-quality swipes. In March 2025, we released a dataset of 1 million swipes under the MIT license, and it is available today on HuggingFace.
We made heavy use of this data to train our models and to evaluate different swipe typing systems.