Toponymy: Topic Modelling in Embedding Space
The package name Toponymy is derived from the Greek topos ‘place’ + onuma ‘name’. Thus, the naming of places. The goal of Toponymy is to put names to places in the space of information. This could be a corpus of documents, in which case Toponymy can be viewed as a topic naming library. It could also be a collection of images, in which case Toponymy could be used to name the themes of the images. The goal is to provide a names that can allow a user to navigate through the space of information in a meaningful way.
Toponymy is designed to scale to very large corpora and collections, providing meaningful names on multiple scales, from broad themes to fine-grained topics. We make use a custom clustering methods, information extraction, and large language models to power this. The library is designed to be flexible and easy to use.
Quick Install
pip install toponymy
Beta Status
Please note: Toponymy is currently in beta. APIs may evolve. We welcome feedback and contributions!
User Guide
Toponymy is designed to be easy to use. The user guide provides a quick start to the library, and a tour of some of the richer functionality and uses cases.
Getting Started:
Getting More out of Toponymy:
Sundries: