Pune, IND
1 day ago
Senior Software Engineer - Storage & Compute Stability, Pune
Senior Software Engineer - Storage & Compute Stability, Pune Location Pune Business Area Engineering and CTO Ref # 10046170 **Description & Requirements** **About the Team** The Storage and Compute Stability Team is a trusted partner in ensuring the reliability, performance, and security of Bloomberg’s cloud storage and compute infrastructure. We operate at the intersection of infrastructure, software, and services, proactively identifying, solving, and preventing issues before they impact our users. Our focus is on streamlining processes, driving automation, and serving as a bridge between product teams and stakeholders. This enables Bloomberg’s engineers to innovate rapidly, while maintaining stability at scale. We follow agile practices and thrive in a collaborative environment where code reviews, design discussions, and brainstorming are part of our daily rhythm. The team is driven by curiosity, creativity, and a shared passion for building efficient, resilient systems. This isn’t just another operations role you’ll be embedded at the core of Bloomberg’s infrastructure. Our team spans infrastructure, software, and services, supporting both short-term needs and long-term strategic investments. **You’ll have the opportunity to:** + Work on critical infrastructure and help define how it evolves + Take on meaningful projects that balance immediate impact with sustainable improvements + Join a culture that values innovation, automation, and continuous improvement **We'll trust you to:** + Ensure system reliability and performance by monitoring, troubleshooting, and optimizing compute and storage services + Proactively identify issues and trends to prevent outages, reduce mean time to recovery (MTTR), and improve overall service availability + Collaborate with product owners, developers, and infrastructure teams to deliver scalable, long-term solutions + Automate operational processes such as deployments, monitoring, maintenance, and capacity management + Develop and maintain runbooks, reproducers, and documentation to support knowledge-sharing and workflow efficiency + Participate in on-call rotations to support critical infrastructure and respond to incidents + Contribute to infrastructure lifecycle management, including capacity forecasting, proactive refresh planning, and upgrades + Continuously explore opportunities to improve team processes and system stability **What we value:** + Our work is guided by key principles that define how we operate: + Expertise – We invest in deep technical knowledge to solve complex infrastructure challenges + Proactivity – We anticipate issues before they occur and design systems to withstand failure + Collaboration – We build strong relationships with product teams and stakeholders to deliver end-to-end solutions + Efficiency – We reduce manual work through thoughtful automation and streamlined processes + Documentation – We believe in capturing and sharing knowledge to make systems transparent and maintainable **What makes you successful:** + Strong communication and collaboration skills; the ability to explain technical concepts to diverse audiences + The ability to be self-motivated and autonomous; you take ownership of problems and drive them to resolution + Passion for continuous learning and working across a broad spectrum of systems and technologies + Being comfortable working in an agile environment, participating in daily standups, sprint planning, and code reviews + Curiosity, adaptability, and eagerness to work across the entire infrastructure stack **You'll need to have:** + 5+ years of demonstrated experience working with object-oriented programming languages such as C/C++ and Python, and the willingness to work with Python as your primary language on the job + Experience with monitoring, logging, and observability tools + Understanding of containers and orchestration technologies + Solid knowledge of networking, operating systems, and distributed systems concepts + Experience participating in incident response and on-call support for production systems **We'd love to see:** + Familiarity with cloud platforms (Ceph or OpenStack) and related compute/storage services + Experience with infrastructure-as-code tools (e.g., Terraform, Ansible) **If this sounds like you:** Apply if you think we're a good match. We'll get in touch to let you know what the next steps are, but in the meantime feel free to have a look at this: Tech at Bloomberg - Bloomberg is an equal opportunity employer and we value diversity at our company. We do not discriminate on the basis of age, ancestry, color, gender identity or expression, genetic predisposition or carrier status, marital status, national or ethnic origin, race, religion or belief, sex, sexual orientation, sexual and other reproductive health decisions, parental or caring status, physical or mental disability, pregnancy or parental leave, protected veteran status, status as a victim of domestic violence, or any other classification protected by applicable law. Bloomberg is a disability inclusive employer. Please let us know if you require any reasonable adjustments to be made for the recruitment process. If you would prefer to discuss this confidentially, please email amer_recruit@bloomberg.net
Por favor confirme su dirección de correo electrónico: Send Email