Research Software Engineer, Omega Data
Google
**Minimum qualifications:**
+ Bachelor’s degree in Computer Science, a related technical field, or equivalent practical experience.
+ 5 years of experience with software development in one or more programming languages (e.g., Python, C, C++, Java, Go).
+ 3 years of experience testing, maintaining, or launching software products, and 1 year of experience with software design and architecture.
**Preferred qualifications:**
+ Master's degree or PhD in Computer Science or a related technical field.
+ Publications in top machine learning conferences such as NeurIPS, ICML, ICLR, TACL, ACL, NAACL, EMNLP, COLM.
+ Experience in machine learning methods including foundation models and Large Language Models (LLMs).
+ Experience in Flume C++, including familiarity with some of the low level details of Flume.
+ Knowledge of Google Infrastructure, including Borg and CNS.
At Google, research-focused Software Engineers are embedded throughout the company, allowing them to setup large-scale tests and deploy promising ideas quickly and broadly. Ideas may come from internal projects as well as from collaborations with research programs at partner universities and technical institutes all over the world.
From creating experiments and prototyping implementations to designing new architectures, engineers work on real-world problems including artificial intelligence, data mining, natural language processing, hardware and software performance analysis, improving compilers for mobile platforms, as well as core search and much more. But you stay connected to your research roots as an active contributor to the wider research community by partnering with universities and publishing papers.
Google Research addresses challenges that define the technology of today and tomorrow. From conducting fundamental research to influencing product development, our research teams have the opportunity to impact technology used by billions of people every day.
Our teams aspire to make discoveries that impact everyone, and core to our approach is sharing our research and tools to fuel progress in the field -- we publish regularly in academic journals, release projects as open source, and apply research to Google products.
For United States Applicants:
The US base salary range for this full-time position is $166,000-$244,000 + bonus + equity + benefits. Our salary ranges are determined by role, level, and location. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific salary range for your preferred location during the hiring process.
Please note that the compensation details listed in US role postings reflect the base salary only, and do not include bonus, equity, or benefits. Learn more about benefits at Google (https://careers.google.com/benefits/) .
**Responsibilities:**
+ Improve the scalability and reliability of the pipelines on the critical path of Gemini pre-training.
+ Develop new techniques to identify near duplicate data.
+ Research new methods of data curation.
+ Serve as Gemini Data Captain or captain new large scale Gemini training runs.
Google is proud to be an equal opportunity workplace and is an affirmative action employer. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or Veteran status. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. See also https://careers.google.com/eeo/ and https://careers.google.com/jobs/dist/legal/OFCCP_EEO_Post.pdf If you have a need that requires accommodation, please let us know by completing our Accommodations for Applicants form: https://goo.gl/forms/aBt6Pu71i1kzpLHe2.
Por favor confirme su dirección de correo electrónico: Send Email