Responsibilities: • Building and maintaining a world-class systems infrastructure delivering high reliability and the best performance possible. • Ensuring cost-effective service delivery by automating critical processes, including server deployment, configuration, monitoring, and problem resolution. • Ensuring systems are secure and compliant with industry best practices. • Performing quick and accurate troubleshooting, diagnostic to production system. • Managing and tuning RDBMS and NoSQL DB servers including postgresql, mongodb and hbase. • Assisting with capacity planning and scalability to ensure systems are optimized for continuous growth. • Designing and implementing system automation architecture, infrastructure, and process using tools such as Salt, Puppet or Chef. • Working very closely with Engineering team ensuring new products and features meet Operational requirements. • Establishing a metrics-driven approach to tracking and continuously improving service quality and risk mitigation in every sense. • Providing occasional off-shift availability for Production issues or maintenance.
You: You should have a good experience in the system operation related field and be ready to take the responsibility of managing this important part of App Annie. • 3+ years experience in network and system engineering position. • Strong skills with Linux system (ubuntu, debian) administration including nginx/apache/haproxy. • Good DB operation experience on PSQL scripting, Postgres replication or Load Balancing is big plus. • Good Experience in large scale Hadoop cluster (HDFS, Mapreduce) operation/tuning/monitoring is a big plus. • Hands on experience in configuration management tool like (salt,chef,puppet). • Hands on experience on nagios,munin and other open source monitoring solutions. • Hands on experience in AWS cloud service (EC2, SQS, S3). • Strong RDBMS operation experiences (PostgreSQL, Mysql). Scale-out solution like database partiti