How does pricing work? Am I paying for idle time?

You only pay for actual compute time when your jobs are running. Valkyrie automatically terminates idle clusters, so there are no surprise bills from forgotten instances. Pricing is transparent and usage-based—you're billed per minute of actual execution time across our provider network.

Where does my data go? Can I control data residency?

Your data never leaves the isolated compute environment during job execution. All instances are ephemeral—completely wiped after job completion. For enterprise customers, we support deployment in your own cloud environment for complete data sovereignty and compliance with regional requirements.

What happens if a job fails or gets interrupted?

Valkyrie includes intelligent retry logic and reconciliation. If hardware fails, we automatically migrate your job to available resources. You get clear status reporting (queued, spinning up, running, failed, complete) so you always know what's happening. Failed jobs can be easily restarted without losing progress.

How do I get my results back?

Results can be retrieved directly through the API, automatically uploaded to your S3/GCS/Azure storage, or accessed through our secure download endpoints. You choose the method that fits your workflow—whether that's polling for completion or setting up webhooks for notifications.

Is this just another wrapper around AWS/GCP?

No. Valkyrie intelligently orchestrates across multiple providers (Vast.ai, RunPod, Hetzner, and others) to find the best price-performance for your specific workload. We handle provider failures, optimize routing, and abstract away the complexity of managing multiple cloud relationships.

What about security and compliance? Can I get SOC 2?

Valkyrie is built with enterprise-grade security from day one. We support GDPR compliance, provide detailed audit logs, use encrypted communications (TLS 1.2+), and offer role-based access control. SOC 2 certification is on our roadmap for enterprise customers.

The API for open-weight AI

One REST call.
Any Model.

We got tired of wiring together GPUs, queues, storage, and auth for every AI project. So we made it all just REST calls. Pay only for what you use.

Get Access

devops tax

The Infrastructure Tax That's Stifling Innovation

Your team has brilliant ideas. You have a group of models that you are ready to try and deploy. But you're stuck wrestling with GPU provisioning, environment setup, and cloud complexity instead of building what matters.

Weeks Lost to cluster setup

Hunt for GPUs across providers. Debug environments that break in production. Fight fragile APIs and unclear errors. Your competitive advantage dies waiting for infrastructure.

Bleeding Money on Idle Resources

Pay for clusters that sit unused. Overprovision for traffic spikes. Watch bills spiral while idle resources burn cash and you're still wrestling with basic setup.

Engineering Talent Wasted

Your best developers become reluctant DevOps engineers. Security reviews slow every new tool addition. They're fixing infrastructure instead of solving problems.

From API call to results without the tax

How Valkyrie Works

You bring scripts, models, and data. Valkyrie provisions best‑fit compute (e.g., Vast.ai, RunPod, Hetzner, and others), manages dependencies, reports status clearly, and tears down cleanly when done.

Send job

Call a resilient REST endpoint with your job, artifacts, and params.

Spin up compute

Provision low cost GPU compute.

Execute

Status‑aware endpoints with retries and reconciliation.

Tear down

Environments auto‑terminate and are wiped.

Access Valkyrie

Access Valkyrie Your Way

Run models from your terminal, IDE, or favorite AI coding assistant. No web dashboard required. Valkyrie is the fastest way to mount models, run workflows, and deploy agentic solutions.

Command Line

‍

$ valkyrie inference
	llama-3-70b
$ python run agent.py 
	--dataset ./reviews.csv
$ valkyrie train 
	--model llama-3-70b 
    	--epoch 5

‍

The fastest way to mount models, run workflows, and deploy—all from your terminal.

Cursor

Connect via MCP to run Valkyrie models directly in your editor. Switch between models mid-session without leaving your IDE.

Claude Code

Integrate Valkyrie through MCP to give Claude Code access to any model. Build agentic workflows that leverage multiple LLMs in a single coding session.

Agentic Workflows

Mix and Match Models to Find What Actually Works

Stop guessing which model is best for your use case. Valkyrie lets you mount multiple models and run them in parallel or sequence against your actual data.

Test combinations like:

Run open-weight and commercial models for Agentic Workflows

Llama 3 for reasoning + Devstral for code generation

Multiple fine-tuned versions of the same base model

Commercial models for baseline vs. your fine-tuned open-weight alternative

Parallel Workflows

Send the same prompt to 3 different models, compare outputs, pick the winner

‍

Example: Test GPT-4, Claude, and 
your fine-tuned Llama on 1,000 
customer support tickets. Measure 
accuracy, response time, and cost. 
Deploy the winner.

Sequential Workflows

Chain models together—use one for data extraction, another for analysis, a third for summarization

‍

Example: Model A extracts entities 
from documents → Model B analyzes 
sentiment → Model C generates 
executive summary. Mix and match 
to optimize each stage.

Upload your dataset once, iterate on model combinations until you find the setup that performs best for your specific task.

What Can You Run

One Platform. Many Possibilities.

Use pre‑configured tools out of the box or bring your own. If it's a script, Valkyrie will run it cleanly and at scale. All via REST. We handle GPUs, scaling, and cleanup. All pay-per-use.

LLM
Inference

Llama, Qwen, Mistral, and more

Fine
Tuning

Train on your data, deploy the result

Image
Generation

Stable Diffusion, Flux

STT & TTS

Whisper. Natural voice synthesis

Tabular GPT

Analyze and build in excel

Object Storage

For your files, models, and outputs

Make Agentic AI Workflows Manageable

Access Every AI Model for chat, analysis, workflow agents, custom apps, image generation and more with your preferred foundational model or your own rolled model

Automatic Speech Recognition

Document Questioning

Text Generation

Text Classification

Image-to-Text

Time Series Forecasting

Tabular Regression

Graph ML

Feature Extraction

Zero Shot Classification

Reinforcement Learning

Tabular Classification

Tabular Regression

Text Ranking

Feature Extraction

Token Classification

Summarization

Fill-Mask

Document Q&A

Visual Document Retrieval

Table Question Answering

Sentence Similarity

Object Detection

Image Classification

Visual Question Answering

Text-to-Image

Video Classification

Image Segmentation

Outcomes That Matter

Make infrastructure invisible. Make progress inevitable.

Valkyrie compresses time‑to‑impact while keeping spend predictable and operations auditable.

Innovate Faster

No more days lost configuring GPUs. Valkyrie provisions, runs, and scales compute in minutes so your team can focus on building, not setup.

Cost-Efficient by Design

Automatic optimization across multiple providers. Auto-termination of idle clusters. Pay only for what you use, when you use it.

Enterprise-Grade Reliability

Auto-scaling, GPU fallback, intelligent job scheduling, and robust authentication. Built for production from day one.

Simple Integration

Intuitive REST APIs designed for resilience. Clear status reporting when models load, clusters spin up, or jobs execute. Get access via Claude Code or Cursor

Security & Compliance

Designed for privacy, auditability, and control

Trust is foundational. Valkyrie bakes in controls that protect data, simplify reviews, and meet enterprise expectations—without slowing teams down.

Data Privacy First

Customer data is never stored beyond job lifecycle. Fully abstracted and isolated workloads. Complete control over your proprietary information.

GDPR & Global Compliance

Built-in data handling policies aligned with GDPR and international privacy regulations. Process sensitive data with complete confidence.

Ephemeral & Isolated

Every compute instance is provisioned into an isolated environment. Automatic termination and data wiping when jobs complete.

Role-Based Access Control

Fine-grained permissions ensure only authorized users and teams can access specific resources or endpoints.

Encrypted Communications

All API requests secured with modern encryption protocols (TLS 1.2+). Support for both API keys and JWT tokens.

Audit Trail

Detailed usage logs and wallet-based billing provide transparent records for internal governance and compliance reviews.

More Control, Less Friction

Better than DIY, more portable than single‑cloud

DIY gives control but drains time. Single‑cloud is convenient until capacity or pricing shifts. Valkyrie is a unified execution fabric that's cost‑smart and portable.

	DIY Orchestration	Single-Cloud Runner	Valkyrie
Provisioning & Scale	Slow, manual	Easy until capacity/pricing shifts	Dynamic across providers
Cost Control	Idle burn common	Vendor-dependent	Auto-optimize + scale-to-zero
Portability	High effort	Low	High (open-weight friendly)
Reliability	Brittle under growth	Varies by region/stock	Retries, reconciliation, failover
Security & Audits	Build it yourself	Mixed	Ephemeral, RBAC, audit logs
Dev Speed	Weeks to stable	Days	Minutes to first output

Early Access for teams ready to build

How To Get Started

We're partnering with select teams to shape the roadmap in real environments. Access is commitment‑light and outcome‑focused.

Builder Teams

For engineering‑led POCs and first production jobs

Core platform access

Pre-configured tools

Email support, usage & cost visibility

Connect via Claude Code or Cursor

Scale & Enterprise

For regulated, high‑throughput, or multi‑team workloads

RBAC, audit logs, wallet‑based billing

SSO/SAML (roadmap), VPC (where applicable)

Priority support, roadmap influence

All Builder features

Frequently Asked Questions

What types of models and workloads can I run on Valkyrie?

Valkyrie can execute any script or model that runs in a containerized environment. This includes XGBoost, scikit-learn, PyTorch, TensorFlow models, custom Python/R scripts, data processing pipelines, and even non-ML workloads like simulations or batch analytics. If it runs on Linux, it runs on Valkyrie. Valkyrie automatically selects optimal hardware for your workload, including GPU-accelerated instances when needed. For custom environment requirements, you can specify dependencies in your job submission. Enterprise customers can work with us to pre-configure specialized environments for recurring use cases.

How quickly can I get started?

If you have a working script and model, you can submit your first job within minutes of getting API access. No lengthy onboarding, no infrastructure setup, no DevOps expertise required. Our early access program includes direct support to help you optimize your first workflows. Valkyrie's REST API integrates seamlessly with existing tools like Airflow, Kubeflow, MLflow, or custom orchestration systems. You can treat Valkyrie as a compute backend that plugs into your current workflow without disrupting your established processes.