Toponymy logo

Toponymy: Topic Modelling in Embedding Space

The package name Toponymy is derived from the Greek topos ‘place’ + onuma ‘name’. Thus, the naming of places. The goal of Toponymy is to put names to places in the space of information. This could be a corpus of documents, in which case Toponymy can be viewed as a topic naming library. It could also be a collection of images, in which case Toponymy could be used to name the themes of the images. The goal is to provide a names that can allow a user to navigate through the space of information in a meaningful way.

Toponymy is designed to scale to very large corpora and collections, providing meaningful names on multiple scales, from broad themes to fine-grained topics. We make use a custom clustering methods, information extraction, and large language models to power this. The library is designed to be flexible and easy to use.

Quick Install

pip install toponymy

Beta Status

Please note: Toponymy is currently in beta. APIs may evolve. We welcome feedback and contributions!

User Guide

Toponymy is designed to be easy to use. The user guide provides a quick start to the library, and a tour of some of the richer functionality and uses cases.

Indices and tables