The Rice Market AI System requires a cross-functional team of 3-4 members with complementary skills,
structured around business capabilities to ensure autonomy, ownership, and end-to-end responsibility.
Teams follow the "you build it, you run it" philosophy, taking ownership from development through production.
I. Core Product Teams (Business Capabilities)
1. Natural Language SQL (NL+SQL) Team
Mission: Enable users to query ERP data using natural language, translating it into secure SQL queries.
Key Responsibilities & Components:
- NL+SQL Agent Service (Vertex AI Gemini 1.5 Pro + LangChain): Process natural language, identify intent, extract entities using LLM Engine
- SQL Generator (LangChain + Vertex AI): Create parameterized SQL queries with proper escaping
- SQL Proxy/Guard (Cloud SQL Auth Proxy): Implement Cloud IAM-based validation, row-level security, allowlist enforcement
- Result Formatter: Format query responses for user consumption (JSON with metadata)
- Cloud SQL PostgreSQL with Read Replicas: Optimize query execution on read replicas, manage data retrieval
- Containerization: Cloud Run containerization with multi-stage builds for NL+SQL Agent Service (Vertex AI Gemini 1.5 Pro + LangChain) with test query processing
- Model Training: Fine-tune Vertex AI Gemini models for rice market domain, achieve >80% accuracy
Core Skills:
NLP with Vertex AI Gemini 1.5 Pro via LangChain
Vertex AI LLMs (Gemini Models)
SQL
Cloud SQL Database Design
Cloud IAM & SQL Security (RBAC)
Cloud Run API Development
2. RAG-Based Document Summarization Team
Mission: Provide intelligent summarization of documents based on Retrieval-Augmented Generation.
Key Responsibilities & Components:
- RAG Orchestrator (Vertex AI Agent Builder + Vector Search): Generate query embeddings (768-dimensional vectors)
- Vertex AI Vector Search: Perform similarity searches using HNSW indexes
- Ranker Module: Re-rank documents using cross-encoder models
- Generator Service (Vertex AI Gemini): Create LLM-based summaries with citations
- Query Embedding (textembedding-gecko@003): Convert text to 768-dimensional vectors, search Vertex AI Vector Search
- Cloud Run Deployment: RAG Orchestrator (Vertex AI Agent Builder + Vector Search) with vector DB connectivity
- Model Training: Fine-tune Vertex AI models leveraging RAG for domain accuracy
Core Skills:
NLP with Vertex AI Gemini 1.5 Pro via LangChain
Vertex AI Vector Search for production embeddings
Cloud SQL with pgvector for hybrid search
Document AI & Information Retrieval
Vertex AI Embedding Models (gecko family)
ML Ranking
3. Time-Series Price Forecasting Team
Mission: Generate accurate, interpretable time-series forecasts for rice prices over specified horizons.
Key Responsibilities & Components:
- Feature Pipeline: Extract features from BigQuery Feature Store with Feast (price history, weather, FX)
- Model Inference Engine: Run LSTM/Prophet ensembles with confidence intervals
- Explainability Module: Generate SHAP values for price driver identification
- Response Formatting: Package predictions with metadata and explanations
- Forecasting Models: Develop 6-month forecasts with confidence intervals
- BigQuery Feature Store with Feast Management: Feature registry, generation, versioning
- MLflow Model Registry on Vertex AI: Version control, champion/challenger strategies
Core Skills:
Time-Series Analysis with PyTorch LSTM models on Vertex AI
LSTM/Prophet
Feature Engineering
Explainable AI
Statistical Modeling
Data Science
II. Supporting Teams (Platform-Oriented)
4. Platform & MLOps with Vertex AI Pipelines, MLflow, and Weights & Biases Team
Mission: Provide robust, scalable, secure microservice platform and MLOps with Vertex AI Pipelines, MLflow, and Weights & Biases infrastructure for all teams.
Key Responsibilities & Components:
- API Gateway (Cloud Endpoints + Cloud Run)/BFF Layer: Authentication, authorization, rate limiting, caching
- Environment Setup: GCP project configuration with IAM and billing
- Repository Structure: Monorepo with proper microservice directories
- Data Pipeline: Ingestion, PII scrubbing, embedding generation
- Vector Store Setup: Deploy and manage Vertex AI Vector Search with Cloud SQL pgvector
- BigQuery Feature Store with Feast: Registry, generation, online/offline storage
- MLflow Model Registry on Vertex AI: Version control, deployment configurations
- ML Pipeline: Automated model retraining workflows
- Container Orchestration: GKE deployment, HPA auto-scaling, Ansible
- CI/CD Pipeline: GitHub Actions for testing and deployment
- Monitoring: Prometheus, Grafana, ELK stack, distributed tracing
- Security: OWASP best practices, security headers implementation
Core Skills:
DevOps with GitHub Actions, Pulumi IaC, and Ansible automation
MLOps with Vertex AI Pipelines, MLflow, and Weights & Biases
Google Cloud Platform (GKE, Vertex AI, BigQuery)/Kubernetes orchestration
Cloud Run containerization with multi-stage builds
CI/CD
Ansible/Terraform
Monitoring Tools
Security
5. Client Application (Frontend) Team
Mission: Design and implement the user interface that interacts with the AI System's capabilities.
Key Responsibilities & Components:
- Frontend Design: React with NextJS framework for SSR/SSG capabilities implementation with chat interface
- API Development: RESTful APIs with OpenAPI specification
- User Experience: Intuitive interface for queries and results
- API Gateway (Cloud Endpoints + Cloud Run) Integration: Consume backend microservices
- Performance Optimization: Client-side performance tuning
Core Skills:
React with NextJS framework for SSR/SSG capabilities
UI/UX Design
API Integration
Web Security
Performance Optimization
Team Principles and Practices
1
Team Size
Each team consists of 3-4 members to foster effective collaboration and reduce communication overhead.
2
Autonomy
Teams operate with limited dependencies, making rapid, context-aware decisions.
3
Ownership
Long-term accountability from conception to production, including on-call responsibilities.
4
Cross-functional
Diverse skills within each team (ML engineers, backend developers, data engineers).
5
Knowledge Sharing
Communities of practice (Chapters and Guilds) to disseminate knowledge without dictating choices.
6
Design Reviews
RFC process for new services with feedback from various teams to catch issues early.
7
Documentation
Living documentation including service overviews, contracts, runbooks, and metadata.
8
Consistency
Microservice chassis for common functionalities ensuring consistency with technical heterogeneity.