Building the core kernels of multimodal agentic AI

We design optimized, efficient and high performance vision-language models for video summarization, document understanding and object tracking, alongside RLVR-based reasoning models for domain-specific intelligence and agent tool use

Our Philosophy

To build AI grounded in phronesis (practical wisdom), integrating context, ethics, and causal reasoning to enable autonomous systems that align with human values, anticipate consequences, and optimize for societal well-being.

Our state of the art models

Tailor made models for each use case and industry

Owlet-Phi-Audio

Designed for comprehensive video understanding, specially in excels in person identification, tracking, human activity recognition, and object detection

Owlet-Safety

A document understanding model designed to parse, interpret, and analyze multilingual documents with exceptional accuracy

Owlet

A family of lightweight, efficient models designed for advanced video understanding

RZN-Med

causal language model created for medical reasoning on open-ended questions.

Research Blogs

Join Us in Shaping the Future of AI

Collaborate with our innovative research team or apply to help drive groundbreaking advancements in AI technology