AppCard Inc. is a technology and marketing company headquartered in Manhattan, NY. Appcard has a powerful marketing tool that leverages data acquired at the point of sale (POS) via an advanced rewards program to create advanced retargeting campaigns that help businesses increase their bottom line. AppCard is unique in the loyalty space due to its patented technology which allows businesses to capture shopper identity and item level data in realtime from purchases made in store and online. The benefit of this is two fold: consumers benefit by receiving offers, incentives and coupons. Through a shopper's interactions with the former AppCard's platform records and learns shopper behavior and gives grocers the ability to make their data actionable to increase average basket size and systematically increase repeat purchases.About the role:At AppCard, we power AI-driven customer loyalty and marketing solutions, processing millions of daily transactions. We are looking for a Senior Site Reliability Engineer (SRE) to help establish and lead SRE best practices within our CloudOps team. This is a hands-on role for an experienced professional who can maintain and optimize cloud production systems, ensure real-time monitoring, and implement robust security measures.What you'll do:Maintain and optimize cloud production systems, ensuring high availability, reliability, and performance through robust monitoring, alerting, and backup restoring mechanismsImplement and manage security tools (must-have experience), proactively identifying vulnerabilities and enforcing security best practices to protect infrastructure and dataAutomate operations and build self-healing systems using Infrastructure as Code (IaC) tools like Terraform and CrossPlane, reducing manual effort and improving system efficiencyEnhance observability and monitoring with DataDog, Prometheus, and Grafana, developing intelligent alerting and anomaly detection to ensure system health and uptimeLead incident response and troubleshooting, ensuring swift recovery, conducting Root Cause Analysis (RCA), and driving continuous improvements in system stabilityOptimize and automate CI/CD pipelines with GitHub Actions, Azure DevOps, or ArgoCD, enabling smooth, reliable, and efficient deploymentsStrengthen system resilience and scalability, applying chaos engineering principles and designing architectures that support high-scale, production-critical environmentsWhat you have:5+ years of experience in SRE, DevOps, or Cloud Engineering, with a strong track record in managing and optimizing cloud infrastructureProven ability to maintain and enhance cloud production environments, ensuring high availability and performance (AWS preferred)Expertise in security tools and best practices (mandatory), proactively identifying and mitigating vulnerabilities to protect infrastructureStrong experience in monitoring, alerting, and backup restoring, ensuring system reliability, quick incident resolution, and data protectionHands-on proficiency with Terraform, Kubernetes, and Infrastructure as Code (IaC) to automate deployments and infrastructure managementDeep knowledge of incident response, troubleshooting, and system performance tuning, with the ability to diagnose and resolve complex issues efficientlyStrong problem-solving mindset, thriving in fast-paced, production-critical environments, with the ability to balance operational stability and innovationCreative problem-solver with an innovative mindsetA team player, self-motivated, fast learnerFluent in English⚠הגש מועמדותמשרות דומות שיכולות לעניין אותך17/07/2025תל אביב, 12.01 ק"מ ממיקומךJPMorganChase**Job Description****Software Engineer III - DevOps / SRE / Java & Python Programming / Cloud****Job Description**There's nothing mo...18/07/2025תל אביב, 12.01 ק"מ ממיקומךCheck Point SoftwareWhy Join Us?We are looking for a Site Reliability Engineering (SRE) & Production Team Leader to join our Engineering team. Someone who h...19/07/2025תל אביב, 12.01 ק"מ ממיקומךClarotyWe're growing and looking to hire Site Reliability Engineer (SRE) who embodies our core values: People First, Customer Obsession, Strive for Excel...22/07/2025תל אביב, 12.01 ק"מ ממיקומךJPMorganChase**Job Description**As a Lead Site Reliability Engineer at JPMorgan Chase within Commercial and Investment Bank, Digital Platform & Services SRE ...17/07/2025תל אביב, 12.01 ק"מ ממיקומךVeeva SystemsVeeva Systems is a mission-driven organization and pioneer in industry cloud, helping life sciences companies bring therapies to patients faster. As o...16/07/2025תל אביב, 12.01 ק"מ ממיקומךMinute Media**Minute Media is a leading global tech-driven media company, developing all its technological needs in-house, end to end, in a rapid big-data environ...16/07/2025תל אביב, 12.01 ק"מ ממיקומךHoneyBook**Here Is The Gist**HoneyBook is the leading clientflow management platform that makes it easy for independent business owners to sell and deliv...16/07/2025תל אביב, 12.01 ק"מ ממיקומךVAST DataThis is a great opportunity to be part of one of the fastest-growing infrastructure companies in history, an organization that is in the center of the...16/07/2025כפר סבא, 3.86 ק"מ ממיקומךParallel WirelessParallel Wireless is reimagining mobile networks with innovative, energy-efficient Open RAN solutions. Join us as we lead the future of telecommunicat...קצת עלינוMploy הוא לוח דרושים מבוסס AI, שנועד לסייע למחפשי עבודה ולמעסיקים כאחד, תוך יצירת פלטפורמה חדשנית, איכותית המובילה את שוק העבודה בישראל.אנו מאגדים משרות עדכניות מאלפי מקורות בארץ, ומנגישים אותן ביעילות באמצעות סוכן AI חכם שמתאים משרות רלוונטיות למועמדים ומאפשר הגשת מועמדות בלחיצת כפתור.הפלטפורמה שלנו מציעה התאמות משרות מבוססות בינה מלאכותית עם אחוז התאמה אישי, קבוצות WhatsApp ייעודיות לפי תחום, ואפליקציה מתקדמת שמאפשרת חיפוש ושליחת קורות חיים מכל מקום ובכל זמן.Mploy אצלכם בוואטסאפ✨ רוצים להתעדכן בכל המשרות הכי שוות ישר לנייד?הצטרפו לקבוצות הוואטסאפ שלנו וקבלו את כל ההצעות המתאימות - בלי לחפש, ובלי לפספס. מחכים לכם! 📱😊