Job Description:
Looking for an AI Developer / Generative AI Engineer with strong experience in Python, Machine Learning, and Generative AI/LLMs. Candidate should have hands-on experience building AI-powered applications, REST APIs/microservices, and deploying models on AWS/Azure/GCP. Experience with LangChain/LlamaIndex, vector databases, and RAG is a plus.
Key Responsibilities:
- Design, develop, and deploy AI/ML and Generative AI solutions for real-world business use cases.
- Build and integrate LLM-based applications (chatbots, copilots, summarization, search, recommendations, etc.).
- Develop RESTful APIs and microservices to serve AI models in production.
- Work with frameworks like LangChain, LlamaIndex, or similar for AI orchestration.
- Fine-tune, evaluate, and optimize models for performance, accuracy, and scalability.
- Implement data pipelines and preprocessing workflows for training and inference.
- Collaborate with frontend and backend teams to integrate AI features into applications.
- Monitor AI systems in production and continuously improve model performance.
- Follow best practices for security, privacy, and responsible AI usage.
Required Skills:
- Strong programming experience in Python.
- Hands-on experience with Machine Learning / Deep Learning.
- Experience with Generative AI and LLMs (OpenAI, Anthropic, Llama, Hugging Face, etc.).
- Experience building and consuming REST APIs / Microservices.
- Knowledge of PyTorch / TensorFlow.
- Experience with cloud platforms (AWS / Azure / GCP).
- Understanding of vector databases (Pinecone, FAISS, Weaviate, Chroma, etc.).
- Experience with SQL/NoSQL databases.
- Strong debugging, problem-solving, and communication skills.