M8 — Your AI Coding Mate

AI Coding Assistant. Zero Internet. Full Control.

Turnkey AI coding assistant that runs entirely on-premises — tab completions, chat, and inline diffs powered by open-source models with zero internet dependency.

Request a Demo View Architecture

risk_engine.py

class RiskEngine:

"""Evaluate transaction risk scores."""

M8 ConnectedPython

30–40%Productivity Gap

Developers in air-gapped environments miss out on 30–40% productivity gains that cloud-connected teams enjoy from AI coding tools.

78%Want AI Tools

78% of developers in secure environments say they want AI coding assistance but are blocked by data exfiltration policies.

0Alternatives

Zero AI coding assistants exist today that run fully on-premises with no internet dependency. Until M8.

Outside Air-Gap

Developer WorkstationsVS Code + M8 Extension

Inside Air-Gap (Customer Network)

M8 Server (Docker)

API GatewayFastAPI

Admin Dashboard

Completion EngineCode Suggestions

Assistant EngineCode Assistant

PostgreSQLAudit & Analytics

Prometheus + Grafana

NO INTERNET

How It All Fits Together

IDE plugins connect to M8 server over your internal network
vLLM inference engine runs models on your GPU hardware
LDAP/AD integration for authentication — no new accounts
Admin dashboard for usage monitoring and seat management
All traffic encrypted with TLS 1.3 within your perimeter

See M8 in your environment.

Live walkthrough, GPU sizing, security review — 30 minutes, no sales pitch.

Request a Demo

Tab Completion

Context-aware code suggestions as you type, powered by a purpose-built fill-in-the-middle model.

AI Chat

Ask questions about your codebase, generate tests, refactor code — all inside your IDE.

Inline Diffs

Review AI-suggested changes inline with accept/reject controls. No context switching.

Admin Dashboard

Monitor usage, manage seats, configure models, and view analytics across your organization.

LDAP/AD SSO

Single sign-on via your existing directory service. No separate accounts to manage.

TLS 1.3 Encryption

All communication between IDE plugins and the M8 server encrypted with TLS 1.3.

Rate Limiting

Per-user and org-level rate limits prevent abuse and ensure fair resource allocation.

Monitoring & Metrics

Prometheus-compatible metrics endpoint. Grafana dashboards included out of the box.

Air-Gap Deploy

Docker bundle with pre-baked model weights. No internet required at any point during deployment.

Bundle

Download the M8 Docker bundle with pre-baked model weights on a connected machine. Transfer to a portable drive.

Transfer

Carry the bundle across the air gap. Load Docker images on your GPU server inside the secure network.

Install

Run one command. M8 configures vLLM, starts the API server, and connects to your LDAP directory. 30 minutes to working AI coding.

Completion Model

Tab Completion

Focus: Low-latency completionsLicense: Apache 2.0Origin: Allied Nations

Purpose-built for inline code completion. Trained on permissively-licensed code. Optimized for sub-200ms low-latency suggestions.

Apache 2.0Allied NationOpen Weights

Assistant Model

Chat & Code Generation

Focus: Deep code understandingLicense: Apache 2.0Origin: Allied Nations

State-of-the-art open-weight model for code understanding, generation, refactoring, and natural language interaction with large context windows.

Apache 2.0Allied NationOpen Weights

GPU	VRAM	User Capacity	Price Point
RTX 4090	24 GB	5–15 users	~$2K GPU
L4	24 GB	5–15 users	~$0.85/hr
2–4x L4	48–96 GB	30–80 users	~$3–5/hr
L40S	48 GB	30–60 users	~$1/hr
A100 40GB	40 GB	30–50 users	~$2/hr
A100 80GBRecommended	80 GB	50–100 users	~$3/hr
H100	80 GB	100–200 users	~$4/hr
B300	192 GB	300–500 users	Enterprise

RTX 4090

~$2K GPU

VRAM24 GB

Capacity5–15 users

~$0.85/hr

VRAM24 GB

Capacity5–15 users

2–4x L4

~$3–5/hr

VRAM48–96 GB

Capacity30–80 users

L40S

~$1/hr

VRAM48 GB

Capacity30–60 users

A100 40GB

~$2/hr

VRAM40 GB

Capacity30–50 users

A100 80GBRecommended

~$3/hr

VRAM80 GB

Capacity50–100 users

H100

~$4/hr

VRAM80 GB

Capacity100–200 users

B300

Enterprise

VRAM192 GB

Capacity300–500 users

Starter

$20/seat/mo

For small teams getting started with AI coding assistance.

AI-powered tab completion
Up to 25 seats
Basic admin dashboard
Email support
Standard SLA

Request a Demo

Pro

$40/seat/mo

Full AI coding suite with chat and team management.

Tab completion + AI Chat
Inline diffs
Up to 200 seats
LDAP/AD SSO
Full admin dashboard
Priority support

Request a Demo

Enterprise

$60/seat/mo

Maximum performance and compliance for large organizations.

Everything in Pro
Unlimited seats
Multi-GPU clustering
Custom model fine-tuning
Dedicated support engineer
Compliance documentation pack
On-site installation support

Request a Demo

M8 competitive comparison

M8 is the only turnkey AI coding assistant that deploys behind an air gap with open-source models.

Feature	M8	GitHub Copilot	Cursor	Tabnine Enterprise	Continue.dev
Air-gapped deployment
Open-source models
On-premises
Tab completion
Chat assistant
Inline diffs
Admin dashboard
LDAP/AD SSO
GPU profiles
Monitoring (Grafana)
Cost	$20–60/seat/mo	$19–39/seat/mo	$20–40/seat/mo	$39+/seat/mo	Free DIY

M8vsGitHub Copilot

Air-gapped deploymentM8GitHub

Open-source modelsM8GitHub

On-premisesM8GitHub

Tab completionM8GitHub

Chat assistantM8GitHub

Inline diffsM8GitHub

Admin dashboardM8GitHub

LDAP/AD SSOM8GitHub

GPU profilesM8GitHub

Monitoring (Grafana)M8GitHub

CostM8$20–60/seat/moGitHub$19–39/seat/mo

M8vsCursor

Air-gapped deploymentM8Cursor

Open-source modelsM8Cursor

On-premisesM8Cursor

Tab completionM8Cursor

Chat assistantM8Cursor

Inline diffsM8Cursor

Admin dashboardM8Cursor

LDAP/AD SSOM8Cursor

GPU profilesM8Cursor

Monitoring (Grafana)M8Cursor

CostM8$20–60/seat/moCursor$20–40/seat/mo

M8vsTabnine Enterprise

Air-gapped deploymentM8Tabnine

Open-source modelsM8Tabnine

On-premisesM8Tabnine

Tab completionM8Tabnine

Chat assistantM8Tabnine

Inline diffsM8Tabnine

Admin dashboardM8Tabnine

LDAP/AD SSOM8Tabnine

GPU profilesM8Tabnine

Monitoring (Grafana)M8Tabnine

CostM8$20–60/seat/moTabnine$39+/seat/mo

M8vsContinue.dev

Air-gapped deploymentM8Continue.dev

Open-source modelsM8Continue.dev

On-premisesM8Continue.dev

Tab completionM8Continue.dev

Chat assistantM8Continue.dev

Inline diffsM8Continue.dev

Admin dashboardM8Continue.dev

LDAP/AD SSOM8Continue.dev

GPU profilesM8Continue.dev

Monitoring (Grafana)M8Continue.dev

CostM8$20–60/seat/moContinue.devFree DIY

Security by Design, Not by Promise

M8 does not just promise data protection — it makes data exfiltration architecturally impossible. No internet connection means no data can leave your network, period.

Zero data leaves your network — ever
TLS 1.3 encryption for all internal traffic
LDAP/AD SSO — no separate credentials
Apache 2.0 models from allied nations only
Full source code available for audit (BSL 1.1)
Containerized deployment with read-only filesystem

CMMC 2.0

ITAR

HIPAA

PCI-DSS

SOC 2

FedRAMP Ready

All models run locally via vLLM on your GPU server. Docker images include pre-baked model weights. Zero external API calls are made at any point — during installation, operation, or updates.

M8 uses purpose-built open-source models — one optimized for fast tab completion (fill-in-the-middle), another for chat and code generation. Both are Apache 2.0 licensed, sourced from allied-nation research institutions.

30 minutes from bundle transfer to a working AI coding assistant. One command installs and configures everything — vLLM, the API server, LDAP integration, and IDE plugin distribution.

M8 supports 8 GPU profiles ranging from RTX 4090 (5–15 users) to NVIDIA B300 (300–500 users). See the GPU compatibility table above for full details.

Yes. M8 achieves zero data exfiltration by design — no network calls leave your perimeter. It supports deployment requirements for CMMC 2.0, ITAR, HIPAA, and PCI-DSS. TLS 1.3 encryption and LDAP/AD SSO are built in.

GitHub Copilot requires internet connectivity and sends code to Microsoft servers for processing. M8 runs entirely on-premises with open-source models — similar tab completion and chat features, but zero data ever leaves your network.

M8 uses Apache 2.0 licensed models with no usage restrictions. The M8 software itself is BSL 1.1, which automatically converts to Apache 2.0 in 2030. Full source code is available for customer audit.

Yes. We offer a 2-week free trial with the full air-gap bundle. The trial includes defined success criteria, an evaluation framework, and direct engineering support during the evaluation period.

GPU

VRAM

User Capacity

Price Point

RTX 4090

24 GB

5–15 users

~$2K GPU

24 GB

5–15 users

~$0.85/hr

2–4x L4

48–96 GB

30–80 users

~$3–5/hr

L40S

48 GB

30–60 users

~$1/hr

A100 40GB

40 GB

30–50 users

~$2/hr

A100 80GBRecommended

80 GB

50–100 users

~$3/hr

H100

80 GB

100–200 users

~$4/hr

B300

192 GB

300–500 users

Enterprise

M8 competitive comparison

M8 is the only turnkey AI coding assistant that deploys behind an air gap with open-source models.

Feature	M8	GitHub Copilot	Cursor	Tabnine Enterprise	Continue.dev
Air-gapped deployment
Open-source models
On-premises
Tab completion
Chat assistant
Inline diffs
Admin dashboard
LDAP/AD SSO
GPU profiles
Monitoring (Grafana)
Cost	$20–60/seat/mo	$19–39/seat/mo	$20–40/seat/mo	$39+/seat/mo	Free DIY

M8vsGitHub Copilot

Air-gapped deploymentM8GitHub

Open-source modelsM8GitHub

On-premisesM8GitHub

Tab completionM8GitHub

Chat assistantM8GitHub

Inline diffsM8GitHub

Admin dashboardM8GitHub

LDAP/AD SSOM8GitHub

GPU profilesM8GitHub

Monitoring (Grafana)M8GitHub

CostM8$20–60/seat/moGitHub$19–39/seat/mo

M8vsCursor

Air-gapped deploymentM8Cursor

Open-source modelsM8Cursor

On-premisesM8Cursor

Tab completionM8Cursor

Chat assistantM8Cursor

Inline diffsM8Cursor

Admin dashboardM8Cursor

LDAP/AD SSOM8Cursor

GPU profilesM8Cursor

Monitoring (Grafana)M8Cursor

CostM8$20–60/seat/moCursor$20–40/seat/mo

M8vsTabnine Enterprise

Air-gapped deploymentM8Tabnine

Open-source modelsM8Tabnine

On-premisesM8Tabnine

Tab completionM8Tabnine

Chat assistantM8Tabnine

Inline diffsM8Tabnine

Admin dashboardM8Tabnine

LDAP/AD SSOM8Tabnine

GPU profilesM8Tabnine

Monitoring (Grafana)M8Tabnine

CostM8$20–60/seat/moTabnine$39+/seat/mo

M8vsContinue.dev

Air-gapped deploymentM8Continue.dev

Open-source modelsM8Continue.dev

On-premisesM8Continue.dev

Tab completionM8Continue.dev

Chat assistantM8Continue.dev

Inline diffsM8Continue.dev

Admin dashboardM8Continue.dev

LDAP/AD SSOM8Continue.dev

GPU profilesM8Continue.dev

Monitoring (Grafana)M8Continue.dev

CostM8$20–60/seat/moContinue.devFree DIY

Security by Design, Not by Promise

M8 does not just promise data protection — it makes data exfiltration architecturally impossible. No internet connection means no data can leave your network, period.

Zero data leaves your network — ever

TLS 1.3 encryption for all internal traffic

LDAP/AD SSO — no separate credentials

Apache 2.0 models from allied nations only

Full source code available for audit (BSL 1.1)

Containerized deployment with read-only filesystem

AI Coding Assistant. Zero Internet. Full Control.

Your Developers Are Locked Out of AI

M8 Brings AI Coding to Your Network

How It All Fits Together

See M8 in your environment.

Everything Your Developers Need

Tab Completion

AI Chat

Inline Diffs

Admin Dashboard

LDAP/AD SSO

TLS 1.3 Encryption

Rate Limiting

Monitoring & Metrics

Air-Gap Deploy

Three Steps to AI Coding

Bundle

Transfer

Install

Trusted Models from Allied Nations

Completion Model

Assistant Model

8 GPU Profiles from Desktop to Data Center

Simple, Predictable Per-Seat Pricing

Starter

Pro

Enterprise

The Only AI Coding Tool Built for Air-Gapped Networks

M8 competitive comparison

Security by Design, Not by Promise

Frequently Asked Questions

See M8 deployed in your environment.

AI Coding Assistant. Zero Internet. Full Control.

Your Developers Are Locked Out of AI

M8 Brings AI Coding to Your Network

How It All Fits Together

See M8 in your environment.

Everything Your Developers Need

Tab Completion

AI Chat

Inline Diffs

Admin Dashboard

LDAP/AD SSO

TLS 1.3 Encryption

Rate Limiting

Monitoring & Metrics

Air-Gap Deploy

Three Steps to AI Coding

Bundle

Transfer

Install

Trusted Models from Allied Nations

Completion Model

Assistant Model

8 GPU Profiles from Desktop to Data Center

Simple, Predictable Per-Seat Pricing

Starter

Pro

Enterprise

The Only AI Coding Tool Built for Air-Gapped Networks

M8 competitive comparison

Security by Design, Not by Promise

Frequently Asked Questions

See M8 deployed in your environment.