Theia

Article

Uber Develops HiveSync for Cross-Region Data Synchronization and Disaster Recovery

DATA AND AI INFRASTRUCTURE

Uber has introduced HiveSync, a sharded batch replication system designed for synchronizing Hive and HDFS data across regions, processing millions of events daily. This system enhances data consistency and supports disaster recovery while minimizing idle hardware costs, featuring components like the HiveSync Replication Service and Data Reparo Service for real-time change capture. Future developments aim to extend HiveSync for cloud replication as analytics and machine learning transition to Google Cloud.

Uber Develops HiveSync for Cross-Region Data Synchronization and Disaster Recovery
Jan 20, 2026, 6:09 AM

No comments yet. Be the first to share your thoughts!