Manifold Research Group (Page 1)

An Open-Source Software Toolkit & Benchmark Suite for the Evaluation and Adaptation of Multimodal Action Models

Pranav Guruprasad, Yangyue Wang, Sudipta Chowdhury, Jaewoo Song, Harshvardhan Sikka (2025)

Recent innovations in multimodal action models represent a promising direction for developing general-purpose agentic systems, combining visual understanding, language comprehension, and action generation. We introduce MultiNet - a novel, fully open-source benchmark and surrounding software ecosystem designed to rigorously evaluate

Benchmarking Vision, Language, & Action Models in Procedurally Generated, Open Ended Action Environments

Pranav Guruprasad, Yangyue Wang, Sudipta Chowdhury, Harshvardhan Sikka, Paul Pu Liang (2025)

Vision-language-action (VLA) models represent an important step toward general-purpose robotic systems by integrating visual perception, language understanding, and action execution. However, systematic evaluation of these models, particularly their zero-shot generalization capabilities in procedurally out-of-distribution (OOD) environments, remains limited. In this

On Orbit Object Transportation With Spacecraft Swarms

Sidhdharth D. Sikka, Zehui Lu, Ayush Rai, Daniel DeLaurentis and Shaoshuai Mou (2025)

The expanding space economy has created an urgent need for reliable on-orbit transportation systems to support both commercial and scientific missions. This paper explores a cooperative approach to orbital transportation, where a swarm of spacecraft agents, each securely attached to

Refusal in LLMs is an Affine Function

Thomas Marshall, Adam Scherlis, Nora Belrose (2024)

We propose affine concept editing (ACE) as an approach for steering language models' behavior by intervening directly in activations. We begin with an affine decomposition of model activation vectors and show that prior methods for steering model behavior correspond to

Intelligent Digital Agents in the Era of Large Language Models

B Faught, H Lu, T Marshall, H Sikka, P Guruprasad, B Gauri (2024)

Below is the abstract for our recent paper, "Intelligent Digital Agents in the Era of Large Language Models". We’re growing our Research Team and pursuing new projects. If you’re interested in working together, join the conversation on

Publications

An Open-Source Software Toolkit & Benchmark Suite for the Evaluation and Adaptation of Multimodal Action Models

Benchmarking Vision, Language, & Action Models in Procedurally Generated, Open Ended Action Environments

On Orbit Object Transportation With Spacecraft Swarms

Refusal in LLMs is an Affine Function

Intelligent Digital Agents in the Era of Large Language Models