Staff Site Reliability Engineer
 Palo Alto, CA
About is a leading provider of cloud-based software that simplifies, digitizes, and automates complex, back-office financial operations for small and midsize businesses. Customers use the platform to manage end-to-end financial workflows and to process payments, which totaled over $70 billion for fiscal 2019. The AI-enabled, financial software platform creates connections between businesses and their suppliers and clients. It helps manage cash inflows and outflows. The company partners with several of the largest U.S. financial institutions, more than 70 of the top 100 U.S. accounting firms, and popular accounting software providers. has offices in Palo Alto, California and Houston, Texas. For more information, visit or follow @billcom.

Professional Experience/Background to be successful in this role:

  • 10+ years experience as a production operations engineer with experience in debugging complex problems across the whole stack, networking (ex: Cisco, Nexus, F5 LTM), storage (ex: Pure, NetApp), and systems (ex: RHEL, Centos, Dell, Nutanix)
  • Expert with AWS services (certified SysOps Administrator or Solutions Architect preferred)
  • Experience with automating systems and infrastructure via Ansible, Puppet, or Chef, and CloudFormation or Terraform
  • 3+ years of automation experience (Python preferred)
  • 5+ years supporting production in a SaaS multi-tenant environment with a modern application framework (Resin/Tomcat/Java) with a highly-transactional database (Oracle/MySQL)
  • Experience with a variety of monitoring and application performance management tools (NewRelic, PagerDuty, Grafana, etc.)
  • Experience with regulatory compliance and bank-level security (PCI, SOC 1/2/3, SOX, bank audits, internal audits)
  • BS or MS degree in Management Information Systems or related discipline

Competencies (Attributes needed to be successful in this role):

  • Have the ability to effectively communicate decisions, ideas, designs, and operation of systems and services in a clear and concise manner
  • Both a generalist, capable of picking up and working with multiple, disparate systems, and an expert, having an ability to dive deep into specific topics and quickly master them
  • Have curiosity about how things work and love to share that knowledge with others
  • Have a passion for helping others and making their lives better, you do this by simplifying complex systems to make them understandable and operable
  • Team player - humble, hungry, and smart
  • Conceptual problem solving - drive projects
  • Industry knowledge - people come to you with questions/help
  • Influence - you are seen as a leader within the team and in the organization
  • Business thinking - seek to understand business needs and apply solutions using technology
  • Project and issue management - able to break down complex projects into bite-size chunks

Expected Outcomes:

  • Drive the migration from on-premise systems to the cloud
  • Help design and implement a highly available infrastructure to meet the needs of our growing and evolving product
  • Help measure and improve reliability and performance
  • Drive continuous improvement by reducing the amount of manual operational work
  • Coordinate with application engineering to drive new technology to support our growth and applications
  • Support a highly available environment as part of an on-call rotation Culture:
●      Humble – No ego
●      Fun –  Celebrate the moments
●      Authentic – We are who we are
●      Passionate – Love what you do  
●      Dedicated – To each other and the customer