1

Scalability Jobs (NOW HIRING)

ABOUT POOLSIDE In this decade, the world will create Artificial General Intelligence. There will only be a small number of companies who will achieve this. Their ability to stack advantages and pull ...

Staff SRE & DeFi scalability lead

San Francisco, CA · On-site

$67.25 - $89.25/hr

This role is focused on ensuring our frontend UI, backend API's, and developer operations are as reliable and scalable as possible. Users need an intuitive, seamless DeFi experience, without ...

next page

Showing results 1-20

People also search for

Scalability information

What are the key skills and qualifications needed to thrive as a Scalability Engineer, and why are they important?

To thrive as a Scalability Engineer, you need a strong background in computer science, distributed systems, and performance optimization, often demonstrated by a relevant degree and experience with high-traffic applications. Familiarity with cloud platforms (like AWS or Azure), containerization tools (such as Docker and Kubernetes), and monitoring systems is typically required. Analytical thinking, problem-solving, and effective communication are essential soft skills for identifying bottlenecks and collaborating with cross-functional teams. These competencies are crucial for ensuring systems remain reliable, efficient, and responsive as user demand grows.

What is scalability in the context of technology jobs?

Scalability refers to a system's ability to handle increased load or demand by efficiently expanding its resources, such as computing power, storage, or network capacity. In technology jobs, ensuring scalability means designing software, hardware, or infrastructure that can grow seamlessly as user numbers or data volume increase. It is a critical factor for businesses that anticipate growth, as scalable systems can adapt without significant redesign or performance loss. Scalability can be achieved through techniques like load balancing, distributed computing, and modular architecture.

What are some common challenges faced by professionals focusing on scalability in software engineering teams?

Professionals responsible for scalability often face the challenge of designing systems that can handle rapid growth in users, data, or traffic without sacrificing performance or reliability. Balancing immediate business needs with long-term scalability planning requires close collaboration with product managers, developers, and operations teams. Additionally, identifying bottlenecks in existing architecture and implementing solutions that won’t disrupt current services can be complex and requires strong analytical and communication skills. Staying current with evolving technologies and best practices is also essential to ensure scalable and cost-effective solutions.

What is the difference between Scalability vs Network Engineer?

AspectScalabilityNetwork Engineer
Required CredentialsKnowledge of system architecture, cloud platforms, and infrastructureNetworking certifications (e.g., CCNA, CCNP), Cisco or similar expertise
Work EnvironmentIT infrastructure, cloud services, software developmentNetwork setup, maintenance, troubleshooting in enterprise or data center settings
Industry UsageTechnology, software development, cloud computingTelecommunications, IT services, enterprise networks
Comparison FocusDesigning systems to handle growth efficientlyConfiguring and managing network hardware and protocols

While scalability focuses on designing systems that can grow and adapt efficiently, network engineers specialize in building and maintaining the network infrastructure that supports such systems. Both roles are essential in IT, but they emphasize different aspects of technology growth and management.

More about Scalability jobs
Infographic showing various Scalability job openings in the United States as of June 2026, with employment types broken down into 96% Full Time, 1% Part Time, and 3% Contract. Highlights an 80% Physical, 7% Hybrid, and 13% Remote job distribution.
VP, Platform Architecture, Scalability & AI Automation

VP, Platform Architecture, Scalability & AI Automation

Resolution Technologies, Inc.

Norcross, GA • On-site

Full-time

This job post has expired today. Applications are no longer accepted.


Job description

IT Leader ??? Platform Architecture, Scalability & AI Automation Role Overview We are seeking a senior Technology Leader to own the architecture, scalability, and intelligent automation of our core FinTech platforms. This role is responsible for designing large-scale, cloud-native, distributed systems while leveraging AI-driven automation to improve platform efficiency, reliability, and operational scale.

The ideal candidate combines deep expertise in platform architecture and distributed systems with a strong point of view on using AI to automate infrastructure operations, optimize performance, and enable predictive, self-healing platforms. This is a highly technical leadership role with material influence over how the platform scales as the business grows. Key Responsibilities Platform Architecture & Technical Strategy Own the end-to-end platform architecture supporting core FinTech products and transaction flows Define architectural standards for scalability, performance, resiliency, and system composability Lead evolution from tightly coupled or monolithic systems toward distributed, service-oriented platforms Establish clear system boundaries, ownership models, and architectural governance Define and execute a multi-year platform roadmap aligned with growth, transaction scale, and product velocity Scalability & Distributed Systems Design platforms capable of handling high transaction volumes, burst traffic, and sustained throughput Guide horizontal scaling strategies across compute, storage, data, and messaging layers Lead architectural decisions around sharding, partitioning, caching, asynchronous processing, and concurrency Continuously improve latency, throughput, and resource efficiency across the platform Enable multi-region and multi-environment scalability where required Cloud & Infrastructure Architecture Architect cloud platforms (AWS, Azure, or GCP) optimized for scale, availability, and operational efficiency Define reference architectures for containerized workloads, microservices, and distributed runtimes Lead Kubernetes and container platform adoption and standardization Mature Infrastructure as Code (Terraform, CloudFormation, etc.) for consistent, scalable environments Own capacity modeling, growth forecasting, and infrastructure lifecycle planning AI-Driven Automation & Intelligent Platforms Apply AI and machine learning techniques to automate platform operations and decision-making Use AI for: Capacity forecasting and demand prediction Anomaly detection in platform performance and system behavior Automated root-cause analysis and incident correlation Predictive scaling and infrastructure optimization Drive adoption of self-healing platform patterns where systems can respond automatically to failure or degradation Enable data pipelines, feature stores, and runtime environments required to support AI-enabled platform services Partner with data and engineering teams to productionize AI capabilities within core platform workflows Platform Engineering & Developer Enablement Build shared platform capabilities that abstract complexity and enable product teams to scale independently Provide self-service infrastructure, golden paths, and opinionated platform tooling Standardize CI/CD, runtime environments, observability, and deployment patterns Reduce friction and cognitive load for application teams through strong platform design Measure and improve developer experience as a platform outcome Reliability, Performance & Intelligent Operations Lead SRE practices focused on scalability, automation, and operational maturity Define and track SLIs/SLOs centered on throughput, latency, availability, and platform health Establish advanced observability (metrics, tracing, logging) as inputs to AI-driven insights Lead analysis of scaling failures, performance bottlenecks, and systemic inefficiencies Drive continuous improvement toward predictable, automated, and resilient operations Required Qualifications 10+ years of experience designing and operating large-scale distributed systems 5+ years in senior technical leadership roles (Director, Principal, VP, or equivalent) Deep expertise in platform architecture, cloud-native design, and system scalability Strong hands-on experience with AWS, Azure, or GCP Proven experience with microservices, event-driven architectures, and distributed data systems Solid background in Infrastructure as Code and automation-first platform design Experience applying AI/ML concepts to operational or platform use cases Preferred Qualifications Experience with high-volume transaction processing or real-time systems Strong Kubernetes and container platform experience Experience with event streaming platforms (Kafka or equivalent) Background modernizing legacy platforms at scale Experience with AI-assisted operations, AIOps, or intelligent monitoring platforms Key Competencies Systems-level architectural thinking with a strong scalability mindset Ability to blend platform engineering and AI automation into practical solutions Technical credibility with senior engineers, architects, and leadership Pragmatic decision-maker who balances ideal architecture with real-world constraints Strong communicator who can translate technical strategy into business impact 30-60-90 Day Success Plan First 30 Days ???

Understand & Assess Develop deep understanding of current platform architecture and scaling limits Review system topology, transaction paths, and performance characteristics Identify opportunities for automation, AI-driven optimization, and architectural simplification Build strong relationships across engineering, data, and product leadership Days 31???60 ??? Architect & Automate Define target-state platform architecture with explicit scalability patterns Prioritize architectural improvements with the highest scale and automation leverage Introduce AI-enabled insights into observability, capacity, or incident analysis Establish platform standards, reference architectures, and design principles Days 61???90 ??? Scale & Industrialize Deliver measurable improvements in throughput, latency, and platform stability Advance automation toward self-service, self-scaling, and self-healing capabilities Roll out platform-level AI automation for operations and performance optimization Finalize a multi-year platform and AI-automation roadmap Establish a culture of building intelligent systems designed to scale by default #RT #DICEJOBS