Remote Source

    AI Observability & Governance Engineer – Agentic ERP Platform

    Remote Malaysia
    Full-Time
    Mid (3-6 yrs)
    Engineering & Development
    Posted on June 3, 2026

    About Rimini Street, Inc.

    Rimini Street, Inc. (Nasdaq: RMNI), a Russell 2000® Company, is a proven, trusted global provider of end-to-end, mission-critical enterprise software support, managed services and innovative Agentic AI ERP solutions, and is the leading third-party support provider for Oracle, SAP and VMware software.

    Our comprehensive portfolio of unified solutions help run, manage, support, customize, configure, connect, protect, monitor, and optimize enterprise application, database and technology software, enabling our clients to achieve better business outcomes, significantly reduce costs and reallocate resources towards strategic projects.

    The Company has signed thousands of contracts with Fortune Global 100, Fortune 500, midmarket, public sector and government organizations who selected Rimini Street as their trusted, proven mission-critical enterprise software solutions provider and achieved better operational outcomes, realized billions of US dollars in savings and funded AI and other innovation investments.

    We are actively seeking an Observability & Governance Engineer – Agentic ERP Platform. This hybrid role is based in our Selangor or Penang office.

    Position Summary

    The Observability & Governance Engineer owns the consumer side of Rimini Street’s Agentic ERP Platform observability and audit infrastructure — the dashboards, alerts, compliance evidence chain, audit query tooling, and customer-facing reporting that turn platform telemetry into something auditors, customer security teams, and operations leaders can actually use. This role makes the platform’s emitted signals visible, queryable, and defensible.

    Reporting to the Security & Identity Lead, this engineer partners closely with the security control plane — producing the audit evidence and compliance posture that prove platform controls are working, and operating the LLM and operational observability that surfaces issues to support, security, and customer-facing teams. The role sits at the boundary between platform telemetry and the people who depend on it — auditors who need SOX-grade evidence, customer security teams who need posture reporting, support teams who need actionable alerts, and the Indemnification Control Owner who needs integrated compliance status. The ideal candidate combines hands-on observability engineering with a structured approach to compliance evidence and a track record of building dashboards that are actually used.

    Essential Duties & Responsibilities

    Compliance Evidence & Audit Chain

    • Operate the dual-stream audit logging architecture (operational telemetry plus immutable compliance records) and ensure every agent action produces a complete, queryable audit chain.
    • Build audit query tooling that lets internal teams and customer auditors trace any agent action back through its decision chain: session, tool invocation, authorisation decision, policy version, before/after state.
    • Produce SOX-grade compliance evidence packages on demand for client audits and regulatory reviews — supporting the Security & Identity Lead’s accountability for platform compliance posture.
    • Implement and maintain audit log retention, immutability guarantees, and access controls aligned to client and regulatory requirements.
    • Support the Indemnification Control Owner with quarterly integrated configuration audit reports covering all monitored vendor indemnification conditions.

    Dashboards & Customer-Facing Reporting

    • Build and maintain operational dashboards covering platform health, agent activity, policy decisions, model usage, cost, and quality signals.
    • Design and deliver customer-facing posture reports: platform health, security status, audit completeness, and SLA compliance.
    • Build alert routing and escalation policies that turn signal noise into actionable operational events for support and engineering teams.
    • Maintain release advisory generation and posture reporting workflows for client distribution.

    LLM Observability Operations

    • Operate the LLM observability layer (LangFuse self-hosted) and ensure complete capture of prompts, responses, costs, and quality signals.
    • Build dashboards for token usage, model cost, latency distributions, and quality drift across the model gateway.
    • Implement cost governance reporting: per-customer cost tracking, budget alerts, and cost optimisation insights for the AI/ML Lead.
    • Coordinate with the AI/ML Lead on model quality and evaluation signal integration into operational dashboards.

    Signal Consumer Integration

    • Operate as the primary consumer for signals produced by Platform Runtime and Platform Experience engineering — audit records, OPA policy decisions, OpenTelemetry traces, LangFuse LLM traces, model gateway cost events, agent activity signals.
    • Build the compliance chain integration that aggregates these signals into the artefacts compliance and audit teams require.
    • Maintain runtime security operations: vulnerability monitoring triage, air-gap update distribution, client patch compliance tracking.
    • Provide signal consumer feedback to Platform Runtime and Platform Experience teams when signal shape, content, or completeness gaps are detected.

    Cross-Hub Collaboration

    • Partner with the Observability Specialist (India) on telemetry pipeline reliability — they own the infrastructure (OpenTelemetry collectors, Jaeger, Prometheus, OpenSearch), this role consumes and packages the output.
    • Partner with the Platform Runtime Engineer (India) on Restate audit integration, dual-stream audit completeness, and durable execution evidence.
    • Partner with the AI/ML Lead (Brazil) on LLM observability scope, evaluation signal integration, and model cost governance.
    • Partner with the QA Testing Lead (Malaysia) on audit log validation and SOX-grade evidence completeness standards.
    • Partner with US Delivery on customer-facing audit responses, security questionnaires, and posture report distribution.

    Experience

    • 6+ years of observability, SRE, or platform operations experience, with at least 2 years owning customer-facing or audit-facing reporting.

    • Hands-on experience operating distributed tracing, metrics, log aggregation, and dashboard tooling at production scale.
    • Proven experience producing compliance evidence for SOC 2, ISO 27001, SOX, or equivalent regulatory frameworks.
    • Experience with multi-tenant SaaS or enterprise platforms where customer-facing reporting is a product requirement.
    • Background in enterprise software, regulated industries, or B2B platforms preferred.

    Technical Skills

    Required

    • Observability stack: OpenTelemetry, Jaeger, Prometheus, Grafana, OpenSearch.
    • Log aggregation, structured logging, and trace correlation across distributed services.
    • Dashboard construction: Grafana panel design, alert rule authoring, dashboard-as-code workflows.
    • PostgreSQL for compliance audit storage; SQL fluency for audit query construction.
    • Python and/or Go for tooling and integration development.
    • Git version control and CI/CD practices for dashboard and alert configuration.
    • Container and Kubernetes operations: pod logs, service mesh telemetry, observability sidecar patterns.
    • Compliance frameworks: SOC 2, ISO 27001, SOX evidence requirements

    Preferred

    • LLM observability tooling: LangFuse, OpenTelemetry GenAI semantic conventions.
    • Experience with policy-as-code platforms (OPA) and OPA decision log consumption.
    • Experience with durable execution audit integration (Restate or equivalent).
    • Familiarity with SIEM platforms and security event correlation.
    • Experience with air-gap deployment scenarios and disconnected operations.
    • Cost observability and FinOps tooling for cloud and LLM cost management.
    • Familiarity with AI assurance frameworks (AIUC-1 or equivalent).
    • Experience with vulnerability management and posture reporting tooling.

    Skills & Competencies

    • Evidence-oriented; understands that compliance is about producing defensible records, not just collecting data.

    • Customer-facing maturity; can build reports and dashboards that withstand scrutiny from external auditors and client security teams.
    • Strong analytical skills; can design dashboards that surface signal and suppress noise.
    • Collaborative; works effectively with platform engineers, security teams, support teams, and compliance leadership across multiple hubs and time zones.
    • Clear communicator; can explain telemetry, audit records, and compliance status to technical and executive audiences.
    • Self-motivated and effective in a remote environment.
    • Fluent in English (written and verbal).

    Desired Qualifications

    • Bachelor’s or Master’s degree in Computer Science, Information Systems, Cybersecurity, or related field.

    • Compliance or audit-related certifications (CISA, ISO 27001 Lead Auditor, or equivalent).
    • Experience in enterprise software companies, regulated industries (finance, healthcare), or B2B SaaS platforms.
    • Contributions to open-source observability tooling.

    Location & Travel

    Location: Hybrid - Selangor or Penang office

    Travel: Minimal; occasional travel for team meetings or training

    Language: Fluent English required (written and verbal)

    Why Rimini Street?

    We are looking for talented, passionate people to help us build our future at Rimini Street. We hire only the best, the most extraordinary professionals and provide compensation, bonuses, and benefits to match the skills of our top-performing team members. Do you thrive in a fast-paced environment, enjoy growing together, and get excited about learning new skills? Are you looking for an opportunity to make a true impact as part of a team of extraordinary professionals? This is the place for you.

    Our work is challenging and meaningful. We start and end each day with a sense of achievement and purpose guided by our core values, the Four Cs: 

    • Company
      • We dream big and innovate boldly.  
    • Colleagues
      • We work with extraordinary people who create a culture of mutual respect and collaboration. 
    • Clients
      • We relentlessly pursue solutions that help clients achieve their goals. Our unmatched client care is rooted in our passion for exceptional service. 
    • Community 
      • We believe in leaving the world a better place than we found it. With the Rimini Street Foundation, we’ve made positive impacts in six continents for over 425 charities.

    Accelerating Company Growth

    • Nasdaq-listed under ticker symbol RMNI since October 2017 
    • Over 6,300+ signed contracts to date, including Fortune 500 and Global 100 companies
    • Over 2,000 team members in 23 countries
    • US and international recognition for industry leadership and philanthropic efforts. See all of our awards and recognitions here: https://www.riministreet.com/company/awards/ 

    Rimini Street is committed to creating a diverse and inclusive environment and is proud to be an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to age, race, color, religion, national origin, sexual orientation, gender or gender identity, disability, protected veteran status, or any other characteristic protected by law. 

    To learn more about how Rimini Street is redefining the enterprise software support industry, visit http://www.riministreet.com 

    Please Note: Rimini Street does not accept resumes submitted by recruiting/staffing firms unless specifically requested by Human Resources.  Unsolicited resumes will be ineligible for referral fees.

    Company:  Rimini Street

    Delivers third-party enterprise software support and maintenance services.
    1001-5000 employees
    Software & IT Services
    HQ: United States