Production Support Engineer
About QuinStreet: www.quinstreet.com
QuinStreet is a leading digital marketing and media company that empowers millions of people to make informed decisions and connect with the products and services they need. Our innovative technology solutions and data-driven approach ensure our clients achieve their marketing goals efficiently and effectively
Experience: 2-4 Years
Location: Remote / On-site (depending on the role and region)
Job Description:
We are seeking a motivated and skilled Production Support Engineer to join our RevOps team. In this role, you will ensure the continuous uptime of our application systems by monitoring alerts, managing production incidents, and supporting product operations. You will collaborate with cross-functional teams to implement effective alerting strategies, manage post-mortems, and drive improvements to prevent future issues.
As part of the RevOps team, you will be responsible for owning the post-mortem process for outages and ensuring thorough documentation is maintained. Additionally, you'll participate in on-call support for addressing issues related to applications or data pipelines.
Key Responsibilities:
- Alerting and Monitoring: Use alerting tools to monitor application performance. Build and configure alerts to proactively detect and address production outages.
- Cross-functional Collaboration: Work closely with the product, marketplace, and business teams (client and media) to ensure comprehensive alerting coverage and swift resolution of production outages.
- Incident Management: Respond to and resolve escalated production issues, ensuring minimal disruption to business operations and customer experience.
- Post-Mortem Ownership: Own the post-mortem process for production outages. Analyze incidents, document findings, and collaborate with relevant teams to implement corrective actions and avoid future similar issues.
- Documentation: Maintain thorough documentation of production incidents, actions taken, and lessons learned to improve incident response and application reliability.
Required Skills:
- SQL: Solid experience in SQL for querying databases, data manipulation, and troubleshooting.
- Redshift: Familiarity with Amazon Redshift for data warehousing and querying large datasets.
- MS SQL Server: Experience working with MS SQL Server for database management and querying.
- Python: Familiarity with Python scripting for automation and monitoring is a plus.
- Tableau: Proficiency in Tableau for data visualization and reporting.
- Excel: Strong skills in Excel, including advanced formulas and data analysis techniques.
Desired Skills:
- Alerting Tools: Experience working with alerting tools like Datadog to set up and monitor application and pipeline alerts.
- Familiarity with data engineering concepts and pipeline management.
- Experience with cloud-based environments (AWS, GCP, Azure).
Additional Information:
- On-call support: This role requires being on-call in rotational shifts (Day/Night) for a minimum of 2 days per week, including weekends and holidays, to address any issues related to applications or data pipelines. The role can be performed remotely as long as you have internet access.
- Vacation: Vacation plans must be coordinated with a substitute, as required.