Job Description
Job Description
We are seeking a skilled with strong Python expertise to join our team. This role focuses on developing and integrating AI-powered features, with an emphasis on embedding-based systems, semantic search, and large language models (LLMs). The primary project involves building an Outlook bot, with additional opportunities to work on AI products like SharePoint widgets. You will collaborate with the DLA team to integrate self-hosted AI models and ensure solutions align with business goals.
- Design, develop, and maintain embedding-based systems and large language model (LLM) integrations for AI-driven products.
- Develop and optimize search and ranking algorithms, including semantic, hybrid/fusion, and metadata-based approaches.
- Build and enhance AI-powered features such as semantic search, intelligent summarization, drafting assistance, context-aware recommendations, and other advanced language or data-driven capabilities.
- Architect, implement, and optimize APIs and backend services (using FastAPI or similar frameworks) to expose AI capabilities to a variety of client applications and platforms. including VSTO add-ins for Microsoft Outlook.
- Integrate self-hosted and cloud-based AI models into production environments, collaborating closely with cross-functional teams and stakeholders.
- Propose and implement scalable, efficient, and maintainable solutions for AI product development, including data preprocessing, feature engineering, and model serving.
- Ensure robust data security, privacy, and compliance in all AI-powered solutions, adhering to organizational and regulatory standards.
- Apply generative AI techniques, including prompt engineering, fine- tuning LLMs, and building applications for summarization, Q&A, and chatbots.
- Write clear, maintainable code and comprehensive documentation, and collaborate with the DLA team on integrating and supporting self- hosted AI models.
- Monitor, evaluate, and continuously improve the performance and reliability of AI features in production.
- / : Proficient in natural language processing and large language models (e.g., OpenAI, HuggingFace Transformers), including experience with vector embeddings (text, image, etc.) and retrieval- augmented generation (RAG).
- ::Skills in prompt engineering, fine-tuning LLMs, and building generative applications (summarization, Q&A, chatbots).
- & : Skilled in building and maintaining Python APIs (preferably FastAPI), with experience in integrating AI models into production services.
- :Experience with data preprocessing, feature extraction, and efficient storage/retrieval of embeddings and metadata (e.g., JSON, databases).
- & : Knowledge of semantic search, hybrid/fusion ranking, and metadata-based boosting.
- I:Ability to integrate AI models and services into production systems, ensuring scalability, reliability, and low-latency performance.
- & : Ability to write clear, maintainable code and collaborate effectively with cross-functional teams.
- & : Familiarity with containerization (Docker), CI/CD pipelines, and deploying AI services in cloud or on-premises environments.
- & :Knowledge of data privacy, secure model deployment, and responsible AI practices.
- Experience with C#.
- Familiarity with cloud platforms, particularly Azure, for potential expansion.
The current focus is on developing an Outlook bot using VSTO and Python APIs, but the role will involve working on multiple types of applications and AI-driven solutions.