This job board retrieves part of its jobs from: Toronto Jobs | Emplois Montréal | IT Jobs Canada

Job offers selected exclusively for people in Texas

To post a job, login or create an account |  Post a Job

Operations Reliability Engineer

Danta Technologies

This is a Full-time position in austin, tx posted March 18, 2023.

 Title: Opeartions Reliability EngineerLocation: Austin, TX (Onsite)Duration: Long TermThe conversational Engineering team seeks a highly motivated individual with a background in Software Development. In this position, you’ll help craft a seamless, high-quality experience for our customers. Join our team, and you’ll help Apple ensure our customers get the products they want as effortlessly as possible.The Conversational Reliability Operations Engineer is a natural leader and facilitator; a strategic thinker who can “connect the dots” at multiple levels; is driven, organized, and detail-oriented; communicates with ease at all levels; is adept at facilitating actions and resolving conflicts; manages through relationships and influence; and displays grace under fire.Key Qualifications:Experience with DevOps, service reliability engineering(SRE), issue triaging, and troubleshooting experienceExperience with Automation skills using Ansible, Jenkins, and puppetHands-on experience with/CD tools and building pipelines using Jenkins or any other toolContainer tools & Orchestrator systems like Docker and Kubernetes.Messaging Queues systems like KafkaGood hands-on Splunk build dashboards using complex queriesExcellent debugging skills: ability to quickly recognize patterns in failuresExperience with scripting languages such as Python, Perl, shell scripts, etc.Strong verbal and written communication skills and knowledge to coordinate with multiple technical and functional usersSelf-motivated with excellent time management skillsHigh attention to detail, and you are good at finding edge cases.Additional skills a plus, not required:You know iOS and are familiar with the features of the Messages application.You are familiar with essential Machine Learning and NLP conceptsYou’ve worked with Chatbots, Conversational AI, or IVR systemsResponsibilities:Drive critical incidents with cross-functional teams across regions.Use operational tools to monitor performance against the ecosystem, identifying trending issues and escalating any problems to the appropriate team(s). Follow the progress and work with the proper group (s) to identify the root cause and resolve each issue.Look for improvement and automation opportunities to preserve and enhance customer experience.Where you identify customer experience as sub-optimal, work with cross-functional teams to improve quality and reliability. Influence for improvements based on a solid understanding of expected and desired experienceIn an impactful production incident, ensure the issue is triaged and troubleshot by the correct teams and mitigated as soon as possible to restore customer experience. Ensure the root cause is understood.Able to perform code fixes for problems and support activities such as incident trend analysis under minimum supervisionEnsuring resilience through proactive testing and preventive maintenanceSetting up monitoring and alert mechanisms to address issues before they become problemsIdentify learnings or opportunities for improvement and influence for change as part of your quality and reliability strategy.Provide constructive feedback for testability and suitable solutions, relying on data to justify technical decisions.Create and maintain Operations-related documentation and processes for the role in a central location.Effectively document and communicate standards to platform users.Experience:BE/B Tech in Computer Science, 5+ years of experience in the software industryProven work experience in software development and operationsStrong knowledge of infrastructure deployment methodologies, tools, and processesSolid understanding of operations and infrastructure, and scriptingExperience with performance and security testing is a plusUp-to-date on the latest industry trends; able to articulate trends and potential clearly and confidently