ExpertiseUpdated on 23 March 2026
Indexing, Searching & Similarity of data using Multimodal Models
About
Monopteryx offers a multimodal search and indexing system that connects text and images using zero-shot transfer. By leveraging pre-trained deep learning models, the system can categorize and retrieve visual data without the immediate need for extensive data labeling or training from scratch.
While fully supervised models still hold an edge in overall accuracy (as seen in the ~97% Random Forest benchmarks), our zero-shot approach provides a strong, immediate baseline—achieving 74% to 82% accuracy in Earth Observation tests—and performs on par with trained classifiers for specific categories. Built on approximate k-nearest-neighbor algorithms, the system provides a scalable, practical starting point for organizations looking to implement fast text-to-image similarity and search, with the flexibility to fine-tune the models later for higher, domain-specific precision.
Field
- Space and Aerospace
Similar opportunities
Partnership
Participation in open calls and projects as a partner
Lampros Mouselimis
Data and Remote Sensing Analyst at Monopteryx
Paramythia, Greece
Partnership
Perception, Multimodal Learning, and Reasoning
Zongru SHAO
Senior Scientist at Silicon Austria Labs GmbH
Linz, Austria