My Journey of Building a Terraform AI Agent — Automating Cloud Infrastructure with AI

31 October, 2025

By Anoop Kumar, Lead DevOps Consultant, Rackspace Technology

The vision: Why I built this AI agent

As a DevOps Engineer passionate about AI, I always seek ways to optimize and automate cloud infrastructure. Managing Terraform code manually was timeconsuming, error-prone, and required constant validation. I wanted to build something that could help my team and customers deploy resources faster, smarter and with fewer mistakes.

I shared my idea with my manager who encouraged me to explore this further. I also took feedback from my colleagues which helped refine the approach. This led to the creation of the Terraform AI Agent, a GenAI tool that generates, validates, and deploys Terraform code in Azure automatically.

The journey: From idea to execution

Two months ago, I started researching how AI could automate Terraform. Initially, my goal was simple: Generate Terraform code automatically and save it into .tf files.

The Challenge: AI Hallucination & Duplicate Code

However, I soon faced a major issue, the AI was generating the same code repeatedly for similar requests, leading to duplicate resources in Azure. This could cause conflicts and unnecessary costs.

The Solution: Smart Code Generation & Validation

To fix this, I implemented a function that checks existing Terraform infrastructure before generating new code. This ensures:

✔ No duplicate resources are created.
✔ Only unique and necessary code is generated.
✔ Custom modules & templates can be used to meet compliance requirements.
✔ AI hallucinations (incorrect code generation) are eliminated.
After solving the code generation problem, I focused on validating and deploying the Terraform code seamlessly.

How the Terraform AI agent works?

Step 1: Enter infrastructure request

Instead of manually writing Terraform code, users enter their infrastructure request in a simple prompt.

Example: “Create a resource group named aiagent-terraform-rg in uksouth.”
The AI instantly generates Terraform code while checking for existing
infrastructure to prevent duplicates.
Supports custom modules & templates to ensure compliance.

Step 2: Validate and auto-fix code

Before deployment, the AI performs multiple checks using function:
✔ Runs terraform fmt to format the code.
✔ Uses terraform validate to ensure correctness.
✔ Applies TFLint to detect security misconfigurations.
✔ Auto-fixes errors to maintain best practices.

Step 3: Deploy via GitHub Actions

With a single click on “Save & Deploy” function, the Terraform AI Agent: Pushes validated Terraform code to GitHub and Triggers GitHub Actions, which run:

terraform init
terraform validate
terraform plan
terraform apply
Deploys infrastructure in Azure and stores the Terraform state in Blob Storage.

Terraform AI agent architecture and workflow

GitHub repository: https://github.com/anoopkum/terraform-automation

Step-by-step implementation

The heart of our solution is the AI agent that converts natural language into Terraform code. Looking at the repository, we can see how this is implemented in aiagent.py. Initially I use OpenAI’s GPT-4o model but later changed to O3-mini which is good in reasoning/code to convert natural language prompts into structured Terraform code.

User interface workflow

UI has three main action buttons that guide users through the infrastructure creation process:

1. Generate Terraform code button

When users enter their infrastructure requirements as a natural-language prompt, they can click the “Generate Terraform Code” button. This triggers the AI agent to:

Process the natural language description
Convert it into properly structured Terraform code
Display the generated code for review

The generated code appears in a code editor interface where users can review it and make any desired adjustments.

2. Validate Terraform code button

After generating the code, users can click the “Validate Terraform Code” button to initiate the validation process.

This button triggers:

Code formatting with terraform fmt
Syntax checking with terraform validate
Best practices verification with tflint

Green checkmarks indicate successful validations, while red alerts highlight areas needing attention.

3. Save & deploy button:

Once the code passes validation, users can click the “Save & Deploy” button to initiate the deployment process. This button:

Saves the code into terraform file (main.tf)
Creates a new pull request in the GitHub repository
Triggers the GitHub Actions workflow
Automates the approval and merge process
Deploys the infrastructure to Azure

The interface provides real-time status updates during the deployment process, showing the progress from plan generation to resource creation.

Terraform state management

A critical aspect of Terraform infrastructure management is the state file. In our architecture, the Terraform state is stored securely in Azure Blob Storage. The GitHub Actions workflow initializes Terraform with backend configuration that points to the Azure Storage Account and container. The storage account access key is securely fetched from Azure Key Vault during deployment.

Key vault during deployment

Frontend implementation

The frontend interface is implemented using a modern React application. Let’s look at how the core user interface component is structured:

UI implementation (streamlit_app.py)

This Streamlit app implements the step-by-step workflow with the three main buttons and manages the entire process from prompt input to deployment.

View the live demo of Terraform AI agent here
Business impact

Reduces deployment time from hours to minutes
Eliminates human errors with auto-validation and fixes
Ensures compliance with pre-configured modules and security checks
Improves efficiency for DevOps and cloud engineers

Future enhancements

Currently, the Terraform AI agent supports resource creation. The next phase will include:

Modify and delete capabilities for the full infrastructure lifecycle management
Support for additional cloud platforms beyond Azure

Tech stack and AI models

Azure OpenAI (GPT-4o/O3-mini) for code generation.
Terraform for infrastructure as code.
GitHub Actions for CI/CD automation.
Python (FastAPI, Streamlit) for AI agent logic & UI.
Github as a repository.
Azure Cloud.

Check out the terraform-automation repository to get your hands dirty and learn to create the AI agent.

If you need any help or have questions? Feel free to reach out, I’m happy to help!
If you found this helpful, give it a thumbs up, support, and share it with others.

Blog Links

Learn more

My Journey of Building a Terraform AI Agent — Automating Cloud Infrastructure with AI

Recent Posts

Blog Links

Learn more about our cloud automation and AI services