Career Profile
Accomplished technical leader with dual expertise in high-level software architecture and SRE/DevOps leadership.
As Head of SRE, I was responsible for system reliability, incident management, and building a proactive platform engineering function. My background as a Principal Software Engineer ensures solutions are designed for operability and scale from the outset.
I possess deep, hands-on skills in cloud infrastructure (AWS, GCP), Infrastructure as Code (Terraform), containerization (Docker, Kubernetes), Ruby/Go/Rust ecosystems and PostgreSQL database, which I leverage to mentor teams and direct technical strategy.
Experiences
- Led a high-stakes, full-infrastructure migration between GCP continents (São Paulo to Virginia). Successfully relocated three Kubernetes clusters, 30 PostgreSQL databases, and 40 Redis instances with under 2 hours of planned downtime and zero data loss.
- Managed a cross-functional migration team comprising SRE, DBRE, and infrastructure engineers. Orchestrated the entire process, from planning and testing to execution, ensuring a coordinated, single-night cutover.
- Owned overall infrastructure uptime and health, implementing monitoring, alerting, and incident response protocols to maintain high availability standards across all systems.
- Governed cloud spending and vendor contracts, implementing cost-control measures and strategic negotiations that kept infrastructure costs within budget while supporting business growth.
- Platform Modernization: Led the strategic migration from Google App Engine to Kubernetes, establishing a scalable, portable foundation for all company systems.
- CI/CD & Developer Experience: Owned the CI/CD pipeline (GitHub Actions, containers), focusing on speed, security, and reliability to accelerate development cycles.
- Production Platform Management: Operated and advanced the Kubernetes-based deployment platform, ensuring high availability and implementing GitOps workflows.
- Observability Leadership: Architected and implemented the full-stack observability solution (metrics, logs, traces) to drive system transparency and reduce troubleshooting time.
- Financial Ownership: Managed cloud infrastructure budgets, implemented cost-control measures, and realized significant savings through optimization initiatives.
- Team & Function Leadership: Built and managed the SRE/Platform team, establishing core reliability practices and a culture of automation.
- Infrastructure Strategy: Directed the seamless migration of critical services between cloud providers to optimize costs and resilience.
- Data Systems Ownership: Took full operational ownership of large PostgreSQL databases, ensuring performance, stability, and disaster recovery readiness.
- Project Delivery: Managed team capacity and priorities to guarantee the on-time delivery of complex infrastructure projects and platform initiatives.
- Led the Authorization Platform Team for the core Acquiring product, mentoring 4 engineers and owning the roadmap for a system processing an average of 3M+ transactions daily.
- Leveraged ISO8583 Expertise to design and maintain financial transaction processing systems, ensuring seamless and compliant communication with major Payment Network Schemes.
- Architcted Critical Network Integrations by developing and maintaining the proprietary connectors and socket-based services that interface directly with external payment networks, focusing on stability and millisecond-level performance.
- Optimized Database Performance by redesigning key tables and refining SQL queries, resulting in a constant response time and supporting a 500% increase in transaction volume without scaling infrastructure.
In this position, I'm member of a team responsible to maintain systems and APIs from main customer of company. My daily basis job includes develop features requested by customer, solve bugs and observe metrics (response times, resources usage, number of errors) to find which applications should be improved.
- Developed and maintained critical customer-facing APIs and systems using Go and Groovy/Java, directly supporting the company's primary revenue stream.
- Drove system reliability by implementing application instrumentation, metric collection, and monitoring of SLOs/error budgets. Used data-driven insights to prioritize and execute performance improvements.
- Engineered a robust development workflow through Test-Driven Development (TDD), comprehensive automated testing, and the creation of Docker-based CI/CD pipelines, significantly reducing regressions and streamlining deployments.
- Embraced DevOps responsibilities, including Docker image management, pipeline creation, and operational monitoring, ensuring high availability and performance of owned services.
Sole architect, developer, and operator of the company's core internal platform. Translated business needs (stock management, financial integration) into a complete, production-grade system that automated device rental logistics from end-to-end.
- Full-Product Lifecycle Ownership: Single-handedly managed the entire technical stack—from concept and UX design through development, deployment, and ongoing maintenance—for the mission-critical device rental management system.
- Business Process Automation: Developed and integrated all key modules: Stock Control (real-time inventory), Bank Integration (payment processing), and dual Admin & Customer Dashboards, directly replacing manual processes and improving operational accuracy.
- Full-Stack Ruby Expertise: Built a robust, scalable application using Ruby on Rails (RSpec, Sidekiq, Puma) with a PostgreSQL database, implementing automated testing, background job processing, and performance optimization.
- End-to-End Cloud Infrastructure: Designed, deployed, and managed the complete AWS production environment (EC2, RDS, S3, ELB, Route53), ensuring high availability, security, and scalability for the business-critical application.
In this position I was member of a team responsible to provision and mantain the ERP products from company's customers. As a R&D team, we tried lots of tools to discover which ones we needed to deliver a service with very high quality standards.
- Ruby specialist
- Development of orchestration software for internal products
- Frameworks (Rails / Sinatra)
- Tests automation (Rspec / Minitest)
- Web servers deployment (Puma / Unicorn / Passenger)
- Background jobs (Sidekiq)
- Cloud computing specialist
- Orchestration and automation of deployments into public Clouds (AWS/Azure)
- Cluster orquestration (Kubernetes / Docker)
- Infrastructure automation (Terraform)
- Configuration automation (Ansible)
- Monitoring and Logging (Datadog / LogDNA)
- Other Knowledges
- Databases (PostgreSQL)
- General Purpose Cache (Redis)
- Shell Scripting (Bash)
- Linux (Ubuntu / CentOS)
- Development of internal systems (Ruby ecossystem)
- Feature development
- Integration with external partners
- Integration with external services
- Infrastructure/Operations (AWS ecossystem)
- Development of Application Architecture (Load Balancers, Reverse Proxies, Application Servers)
- Maintenance of Application Architecture (Monitoring, House-Keeping, Deployment Automation)
- Internal infrastructure/Operations
- Internet links and High-Availability
- Internal Wi-Fi Maintenance and Monitoring
- Tech Lead/Team Management
- Team Lead
- Interface with Business Teams
- Expectations Management
- Technology definition
- Methodology definition and Process Maintenance
- Recruitment
Projects
This is a list of my side projects.