Chuyển tới nội dung chính

AI Inference

FPT AI Inference is a Model-as-a-Service platform that provides pre-trained AI models as API endpoints. Organizations can integrate Large Language Models (LLMs) and Vision Language Models (VLMs) into their applications without developing, training, or managing models themselves.

Key benefits

  • API integration — connect pre-trained models to existing systems without building infrastructure from scratch.
  • Pay-as-you-go — costs are based on input and output tokens. You only pay for what you use.
  • Auto-scaling — the platform scales automatically based on demand on a serverless architecture with 99.9% availability.
  • Model customization — Foundation model customization services are available for organizations with specific requirements.