Analytics/

Data Reliability Engineer

Worldwide

Emerging Travel Group is a global travel-tech company whose brands have been operating in over 220 source markets since 2010. We specialize in developing advanced online booking platforms for all types of clients — from individual tourists to travel agents and companies organizing business trips. Our solutions empower hoteliers to effortlessly showcase their accommodations, boosting visibility and attracting a broader audience.

Our mission is to create, distribute, and operate the most convenient travel products. We constantly innovate and break the rules of the highly complex travel industry to make travel more widely available for individuals, more rewarding for professionals, and simpler for everyone.

We are looking for an experienced Data Reliability Engineer for our Data Engineering Group

 

Job Responsibilities

  • Monitor, support, and ensure high availability of data platform components: data lake, data warehouse, ETL/ELT pipelines, message queues (Kafka, RabbitMQ, etc.), and data streaming and batch processing tools (Spark, Flink, etc.).
  • Implement and maintain monitoring, alerting, and logging systems, and create dashboards (Prometheus, Grafana, ELK, DataDog, etc.).
  • Analyze incidents, troubleshoot failures, conduct postmortem analyses, and proactively identify and eliminate potential failure points.
  • Implement backup procedures and recovery tests of critical data platform components, and support disaster recovery plans.
  • Automate releases, configurations, and routine operations using scripts, CI/CD systems, and infrastructure as code tools (Terraform, Ansible, Helm, etc.).
  • Ensure the security of data infrastructure: manage access, conduct audits, apply patches, and implement compliance best practices.
  • Participate in scaling and updating the platform: set up new clusters, migrate components, and implement new service versions.
  • Collaborate with developers, analysts, data engineers, and other technical teams to integrate new solutions and solve problems together.
  • Provide and maintain up-to-date technical documentation on architecture, deployment templates, recovery procedures, and best practices.
  • Participate in the planning of platform development, and propose new technologies, tools, and process improvements.
  • Set and control SLAs/SLOs for key services and data processing pipelines, and work on reducing the mean time to recover (MTTR) incidents.
  • Develop and implement reliability and scalability strategies for the company's data platform (data lake, data warehouse, ETL/ELT processes, queues).
  • Design, implement, and develop architectural solutions that ensure high availability, fault tolerance, and security of the data infrastructure.
  • Implement and support monitoring, alerting, observability, and postmortem analysis processes at all levels of the data platform.

Key Qualifications

  • Experience in administering at least one analytical MPP database (Vertica, Greenplum, ClickHouse, or StarRocks);
  • Skills in supporting and operating analytical databases, data warehouses, and ETL/ELT pipelines;
  • Experience in setting up and maintaining monitoring and logging systems (Prometheus, Grafana, ELK, DataDog, or similar);
  • Proficiency with data streaming and batch processing tools (Kafka, RabbitMQ, Spark, Flink, etc.);
  • Experience in automating routine operations (scripts, CI/CD, IaC: Terraform, Ansible, Helm, etc.);
  • Knowledge of backup and disaster recovery practices;
  • Understanding of data infrastructure security principles (access, auditing, patching);
  • Experience collaborating with development teams, analysts, and data engineers;
  • Skills in maintaining technical documentation;
  • Conversational English at level B1 (intermediate).

Would be a plus:

  • Experience in supporting and/or investigating incidents/accidents with ETL tools;
  • Experience setting up security in databases/analytical platforms;
  • Experience integrating/interacting MPP with the big data stack (Hadoop, Spark, Trino).

We Offer You

  • we offer a fully flexible work schedule — there’s no pressure to start work at exactly 9:00 AM; what matters is achieving results and moving forward;
  • each person in our team is encouraged to choose their preferred work format. You can work fully remotely, come to the office, or choose a hybrid work model;
  • we are an ambitious and supportive team who love what they do, appreciate each other, and grow together;
  • the growth and development of each employee is our priority, so we have internal programs available for adaptation and training, development of soft skills and leadership abilities that are tailored individually to each employee;
  • we also provide partial compensation for employees participating in external training and conferences;
  • in tourism, it's difficult to grow without an excellent knowledge of English, and we support our employees' language learning goals — we organize group and individual lessons, plus speaking clubs with colleagues from all over the world;
  • and, of course, to encourage you to travel more, we offer corporate prices on hotels and other travel services;
  • we prioritize well-being and are committed to supporting the overall health and work-life balance at ETG. As part of this commitment, we provide MyTime Day Off — an extra day off that is designed to give our employees the flexibility to focus on important matters, whether it’s taking care of their health, mental recharge, addressing personal issues, or any other important activities.
Apply to this position

Or share with your friends