hosted.ai 2.4: new GPU monetization options, smoother neocloud operations

#BuildingInPublic – the platform story so far

What’s new in hosted·ai v2.4:

1. Sell bare metal GPU instances


2. Sell GPUaaS with user-selected and VIP GPU scheduling


The hosted.ai platform has the most flexible GPU scheduling engine on the market. In v2.4, this has expanded with two new scheduling options that can be configured for your GPUaaS products:

Dynamic / user-selected GPU scheduling: this enables you to create GPUaaS products that give the user the ability to choose a minimum GPU resource percentage they will receive from a shared GPU pool. The hosted·ai scheduler guarantees that percentage of resources for their workloads, and the user is billed accordingly.

VIP priority scheduling: this enables you to create GPUaaS products with prioritized workload scheduling: VIP user workloads are prioritized when multiple tenants access a pool simultaneously, and the user is billed accordingly.

3. RootFS persistence for GPUaaS pods


The root file system of GPUaaS pods can now be made persistent across reboots. This has been implemented using a host-path storage plugin. There is no separate remote volume, no periodic data sync, and zero performance overhead.

RootFS persistence is enabled by default on new instances:

  • Ansible service execution runs the first time the system boots, but not on instance reboot
  • A new ‘factory reset’ with data wipe feature has been implemented to fully erase persistent storage when required
  • Storage quotas are enforced with sysbox (ENOSPC at 96% usage) to prevent pod out-of-storage crashes
  • df/reboot wrappers are used for full VM-like behaviour inside pods, showing the actual usage for storage inside an instance

4. High Availability for KVM clusters


hosted·ai v2.4 introduces automatic primary/secondary failover for KVM cluster panel nodes, using etcd for distributed leader election. A periodic HA agent aligns controller services and SQL database replication to the elected leader without manual intervention. VMs can be individually set to auto-restart or not.

High Availability is enabled via a toggle in the hosted·ai cluster management panel. It enforces a 3-node minimum, and supports full disable/revert.

5. Prometheus metrics framework for KVM


In hosted·ai v2.4, we have replaced the legacy RRD file-based stats system with a Prometheus + libvirt exporter architecture.

It provides per-VM metrics (vCPU, memory, disk I/O, network I/O) and cluster-wide node exporter metrics, with real-time dashboards, AlertManager webhook integration, and batch 10-minute collection. Prometheus configuration is auto-regenerated with a 10-second target sync, as nodes are added or removed.

Next steps


  • To upgrade from previous versions to hosted·ai v2.4, please contact your account manager or our customer success team.

Subscribe to get updates