Job Summary:
Tata Consultancy Services is seeking a Senior Engineer specialized in CockroachDB. The role involves designing, deploying, and managing multi-region CockroachDB clusters, ensuring high availability and performance while troubleshooting complex database issues.
Responsibilities:
• Design, deploy, operate, and scale multi-region CockroachDB clusters in production environments
• Ensure high availability, fault tolerance, and data consistency for globally distributed clusters
• Monitor cluster health, latency, replication status, and resource utilization using observability tools
• Perform capacity planning and proactive scaling for future growth
• Troubleshoot complex database and infrastructure issues including:
• Node failures
• Network partitions
• Leaseholder and range imbalance
• Replication lag
• Hotspotting
• High latency / throughput bottlenecks
• Design disaster recovery strategies (multi-region, backup/restore, failover/fallback)
• Implement and test backup, restore, and point-in-time recovery processes
• Automate provisioning, scaling, patching, and upgrades of CRDB clusters
• Perform rolling upgrades with zero or near-zero downtime
• Optimize SQL query performance and database schema efficiency
• Create operational runbooks, SOPs, and on-call playbooks for CRDB
• Participate in on-call rotations and incident response for production clusters
Qualifications:
Required:
• 8 - 10 Years of experience
• Design, deploy, operate, and scale multi-region CockroachDB clusters in production environments
• Ensure high availability, fault tolerance, and data consistency for globally distributed clusters
• Monitor cluster health, latency, replication status, and resource utilization using observability tools
• Perform capacity planning and proactive scaling for future growth
• Troubleshoot complex database and infrastructure issues including Node failures, Network partitions, Leaseholder and range imbalance, Replication lag, Hotspotting, High latency / throughput bottlenecks
• Design disaster recovery strategies (multi-region, backup/restore, failover/fallback)
• Implement and test backup, restore, and point-in-time recovery processes
• Automate provisioning, scaling, patching, and upgrades of CRDB clusters
• Perform rolling upgrades with zero or near-zero downtime
• Optimize SQL query performance and database schema efficiency
• Create operational runbooks, SOPs, and on-call playbooks for CRDB
• Participate in on-call rotations and incident response for production clusters
Company:
Tata Consultancy Services is a business solutions company that specializes on information technology services and consulting. It is a sub-organization of Tata Group. Founded in 1968, the company is headquartered in Mumbai, IND, with a team of 10001+ employees. The company is currently Late Stage.