Job Summary:
Databricks is a data and AI company that enables data teams to solve complex problems through its infrastructure platform. The Senior Software Engineer will build next-generation distributed data storage and processing systems, enhancing performance and supporting diverse workloads.
Responsibilities:
• Building the next generation distributed data storage and processing systems that can outperform specialized SQL query engines in relational query performance.
• Developing the de facto open source standard framework for big data.
• Providing reliable and high performance services and client libraries for storing and accessing humongous amount of data on cloud storage backends, e.g., AWS S3, Azure Blob Store.
• Creating a storage management system that combines the scale and cost-efficiency of data lakes, the performance and reliability of a data warehouse, and the low latency of streaming.
• Making it simple and possible to orchestrate and operate tens of thousands of data pipelines.
• Building the next generation query optimizer and execution engine that's fast, tuning free, scalable, and robust.
Qualifications:
Required:
• BS (or higher) in Computer Science, related technical field or equivalent practical experience.
• Comfortable working towards a multi-year vision with incremental deliverables.
• Motivated by delivering customer value and impact.
• 5+ years of production level experience in either Java, Scala or C++.
• Strong foundation in algorithms and data structures and their real-world use cases.
• Experience with distributed systems, databases, and big data systems (Apache Spark, Hadoop).
Company:
Databricks is a data and AI platform that unifies data engineering, analytics, and machine learning on a lakehouse architecture. Founded in 2013, the company is headquartered in San Francisco, USA, with a team of 5001-10000 employees. The company is currently Late Stage.