Ops Insights (OPSI) is an Oracle Cloud Infrastructure(OCI) native service that helps customers to get insights about capacity, performance and diagnostics aspects of their cloud and on-premises resources like Database and Compute. Under the cover, this service’s analytical engine is powered by large volume of data ingested from various sources followed by long term data storage and running ML algorithms to piece together various insights and recommendations. All these are done at cloud scale to serve for 1000s of customers across all regions of OCI.
OPSI is looking for a seasoned engineer to join its team to help envision the future of the service, and then build and ship to production. You will join a team comprised of self-motivated full stack engineers and leaders who believe in completely owning all aspects of the service like: design, development, automation testing, SRE, devops and production on-call. If you are self-motivated go getter type of engineer who has passion to use technology to solve innovative problems and automate everything on the way, you will find yourself at home here!
This particular role will be pivotal in build, optimize and productize Generative AI (GenAI) features in OPSI using frameworks like Llama, Retrieval-Augmented Generation (RAG), and Gen AI agents. The ideal candidate will have hands-on experience designing intelligent AI agents capable of reasoning, decision-making, and task automation using LangGraph or equivalent agent frameworks. Proficiency in Python, LangGraph/PyTorch, along with expertise in prompt engineering, integration with external tools, vector databases, building RAG pipelines, and knowledge graph integration, is essential. A deep understanding of building and tuning multi-agent systems is highly desirable. The role requires a creative problem-solver who can innovate and collaborate in a dynamic environment, with a strong ability to communicate complex AI concepts to diverse stakeholders. A degree in computer science, AI, or a related field, along with a proven track record of delivering scalable production GenAI solutions, is preferred.
https://docs.oracle.com/en-us/iaas/operations-insights/doc/operations-insights.html
Skills/experience we are looking for includes but not limited to:
Large scale fault tolerant distributed systems, Oracle database, SQL, Kafka, Java, Javascript, ML, REST APIs, SRE, devops, scripting, terraform, python, docker. Cloud experience would be desirable, but not required.
Preferred Qualifications
- 4+ years’ experience as a Software Developer/Engineer within an Enterprise Product Development or Cloud development team
- 4+ years’ experience with a high-level programming language (Java preferred)
- 4+ years’ experience and expertise working with cloud services/distributed systems
- Must have hands on experience with GenAI, LLM models, RAG pipelines using vector DB, agent implementation
- Must possess excellent debugging skills
- Bachelor’s degree in Computer Science, Engineering or related field
- Technical leadership skills
Additional Competencies
- A can-do attitude with a high interest in learning new technologies and how to function in a multi-dimensional team environment.
- Excellent verbal and written communication skills
- Experienced with multiple software development methodologies, with an emphasis and passion for Agile
- Ability to communicate and collaborate effectively with all levels of the organization and across multiple technical and non-technical groups
- Comfortable and effective in internal and external-facing scenarios
- A can-do attitude with a willingness to take on challenging tasks and the ability to multitask effectively across multiple areas of responsibilities
- Can prioritize tasks effectively to ensure timely delivery of the project
-Position in Guadalajara Metro Area at Oracle Mexico Development Center
Career Level - IC3