Question 1

What is the difference between Cloud GPU and GPU Server?

Accepted Answer

Cloud GPU offers virtual GPU instances (vGPU) that can be flexibly scaled — ideal for variable workloads. GPU Servers are dedicated physical servers with exclusively assigned GPUs — optimal for continuous training with maximum performance.

Question 2

Which GPU models are available?

Accepted Answer

We offer four NVIDIA GPU classes: Tesla T4 (16 GB GDDR6) as a cost-efficient entry-level GPU, A10 (24 GB GDDR6) as an all-rounder, A100 (80 GB HBM2e) for demanding AI workloads, and H200 (141 GB HBM3e) for maximum AI performance. Each GPU can be divided into different profiles — from small slices for inference to the full GPU for training.

Question 3

What is the difference between T4, A10, and A100?

Accepted Answer

The Tesla T4 (Turing, 8.1 TFLOPS FP32) is ideal for cost-efficient inference, VDI, and light ML workloads. The A10 (Ampere, 31.2 TFLOPS FP32) is an all-rounder for ML training, 3D rendering, and virtual desktops. The A100 (Ampere, 80 GB HBM2e, 312 TFLOPS FP16 Tensor) offers MIG isolation for demanding AI workloads. The H200 (Hopper, 141 GB HBM3e, 989 TFLOPS FP16 Tensor) delivers maximum performance for LLM training and large foundation models.

Question 4

What does vGPU mean?

Accepted Answer

vGPU (Virtual GPU) allows a physical GPU to be divided into multiple virtual instances. Each vGPU instance receives dedicated GPU resources and VRAM. This way, you can book exactly the GPU power you need — without having to rent an entire GPU.

Question 5

Which frameworks are supported?

Accepted Answer

Full support for CUDA, cuDNN, and TensorRT. Compatible with all major ML frameworks such as PyTorch, TensorFlow, JAX, and ONNX Runtime. We provide pre-configured container images.

Question 6

Can I combine Cloud GPU with Bare Metal?

Accepted Answer

Yes, via Direct Connect you can seamlessly connect cloud GPU instances with your bare metal servers and colocation hardware — ideal for hybrid AI pipelines.

Question 7

How is billing handled?

Accepted Answer

All resources are billed monthly. You configure vGPU, vCores, RAM, and storage individually and pay only for what you use. No minimum contract term.

Question 8

What are egress costs?

Accepted Answer

Egress costs are fees that cloud providers charge for outgoing data traffic — i.e., for data leaving the data center toward the internet or other networks. Every API response, every download, every video stream, and every backup replication generates egress traffic. Most hyperscalers charge these costs per GB, which can quickly become a significant and hard-to-calculate cost factor. Especially with data-intensive applications, these fees add up rapidly.

Question 9

Does INGATE charge egress costs?

Accepted Answer

No. At INGATE, outgoing data traffic is already included in the price — no per-GB fees, no hidden surcharges. This makes your costs fully predictable and transparent. Especially with data-intensive applications like CDN, streaming, large APIs, or backup replication, this results in a massive cost advantage over the major hyperscalers.

Question 10

How high are egress costs at hyperscalers?

Accepted Answer

AWS charges approximately $0.09/GB (first 10 TB), Azure approximately $0.087/GB, and Google Cloud approximately $0.12/GB. A company transferring 10 TB per month pays approximately $900-1,200/month for outgoing traffic alone. At 100 TB, it is already $8,000-9,000+/month. These costs are difficult to predict as they depend on user behavior, API call volumes, and traffic patterns — complicating TCO calculations and frequently leading to unexpectedly high bills ("bill shock"). At INGATE, traffic is included, making total costs fully calculable from day one.

GPU / Profile	VRAM	FP32	Price/Month
Tesla T4 — T4-4Q	4 GB GDDR6	2,0 TFLOPS	€ 69,00
Tesla T4 — T4-8Q	8 GB GDDR6	4,1 TFLOPS	€ 129,00
Tesla T4 — T4-16Q	16 GB GDDR6	8,1 TFLOPS	€ 249,00
NVIDIA A10 — A10-4Q	4 GB GDDR6	5,2 TFLOPS	€ 99,00
NVIDIA A10 — A10-8Q	8 GB GDDR6	10,4 TFLOPS	€ 189,00
NVIDIA A10 — A10-12Q	12 GB GDDR6	15,6 TFLOPS	€ 279,00
NVIDIA A10 — A10-24Q	24 GB GDDR6	31,2 TFLOPS	€ 549,00
NVIDIA A100 — 1g.10gb	10 GB HBM2e	2,8 TFLOPS	€ 179,00
NVIDIA A100 — 2g.20gb	20 GB HBM2e	5,6 TFLOPS	€ 349,00
NVIDIA A100 — 3g.40gb	40 GB HBM2e	8,4 TFLOPS	€ 529,00
NVIDIA A100 — 7g.80gb	80 GB HBM2e	19,5 TFLOPS	€ 999,00
NVIDIA H200 — 1g.20gb	20 GB HBM3e	9,6 TFLOPS	€ 349,00
NVIDIA H200 — 2g.40gb	40 GB HBM3e	19,1 TFLOPS	€ 679,00
NVIDIA H200 — 3g.70gb	70 GB HBM3e	28,7 TFLOPS	€ 999,00
NVIDIA H200 — 7g.141gb	141 GB HBM3e	67,0 TFLOPS	€ 1.899,00

GPU Cloud

Configure GPU Server

Request a Quote

GPU Model

vGPU Profile

Dedicated vCores

RAM

NVMe Storage

Service Level Agreement

GPU Pricing

Dedicated GPU Power — Full Control

Additional Services

Flexible GPU Scaling

CUDA Ecosystem

Data Sovereignty

Direct Connect

INGATE Premium Support

Container-Ready

Technical Highlights

Redundant Power Supply

High-Efficiency Cooling

Fire Protection

High-Speed Backbone

Physical Security

Sustainability

Certified Data Centers

Frequently Asked Questions

Technology Partners & Memberships