Skip to main content

Senior GenAI Specialist Solutions Architect, Amazon SageMaker Service GTM

Job ID: 2762280 | Amazon Web Services, Inc.

DESCRIPTION

Are you passionate about Generative AI (GenAI)? Do you want to help define the future of Go to Market (GTM) at AWS using generative AI? In this role, you will help some of our largest customers build, fine tune, and deploy Generative AI models using Amazon SageMaker, and help customers leverage these models to power large-scale end applications. You will engage with AWS product owners to influence product direction and help our customers tap into new markets by utilizing GenAI along with AWS Services.

At Amazon, we’ve been investing deeply in artificial intelligence for over 20 years, and many of the capabilities customers experience in our products are driven by machine learning. Amazon.com’s recommendations engine is driven by machine learning (ML), as are the paths that optimize robotic picking routes in our fulfillment centers. Our supply chain, forecasting, and capacity planning are also informed by ML algorithms. Alexa is fueled by Natural Language Understanding and Automated Speech Recognition deep learning; as is Prime Air, and the computer vision technology in our new retail experience, Amazon Go. We have thousands of engineers at Amazon committed to machine learning and deep learning, and it’s a big part of our heritage.

AWS is looking for a Generative AI Solutions Architect who will be the Subject Matter Expert (SME) for helping customers in designing solutions that leverage our Generative AI services. You will interact with customers directly to understand the business problem, help and aid them in implementation of generative AI solutions, deliver briefing and deep dive sessions to customers and guide customer on adoption patterns and paths for generative AI. As part of the Generative AI Worldwide Specialist organization, you will work closely with other Solution Architects from various geographies to enable large-scale customer use cases and drive the adoption of Amazon Web Services for GenAI services. You will interact with other Data Scientists and Solution Architects in the field, providing guidance on their customer engagements. You will develop white papers, blogs, reference implementations, and presentations to enable customers and partners to fully leverage Generative AI services on Amazon Web Services. You will also create field enablement materials for the broader technical field population, to help them understand how to integrate AWS Generative AI solutions into customer architectures. You drive effective feedback gathering from customers, and you distill and translate that feedback into clear business and technical requirements for product and engineering teams to review.

You must have deep technical experience working with technologies related to large language models (LLM) including LLM architectures, distributed training and inference, model evaluation, and fine-tuning techniques.

Candidates must have great communication skills and be very technical, with the ability to impress Amazon Web Services customers at any level, from executive to developer. You will get the opportunity to work directly with senior GenAI engineers and Data Scientists at customers, partners and Amazon Web Services service teams, influencing their roadmaps and driving innovation.

Travel up to 50% may be possible.

Key job responsibilities
You will help develop the industry’s best cloud-based solutions to grow the GenAI business. Working closely with our engineering teams, you will help enable new capabilities for our customers to develop and deploy GenAI workloads on AWS. You will facilitate the enablement of AWS technical community, solution architects and, sales with specific customer centric value proposition and demos about end-to-end GenAI on AWS cloud.

You will possess a technical and business background that enables you to drive an engagement and interact at the highest levels with startups, Enterprises, and AWS partners. You will have the technical depth and business experience to easily articulate the potential and challenges of GenAI models and applications to engineering teams and C-Level executives. This requires deep familiarity across the stack – compute infrastructure (e.g., Amazon EC2, EKS, SageMaker, Lustre), ML frameworks PyTorch, JAX, orchestration layers Kubernetes and Slurm, parallel computing (NCCL, MPI), MLOPs, as well as target use cases in the cloud.

You will drive the development of the GTM plan for building and scaling GenAI on AWS, interact with customers directly to understand their business problems, and help them with defining and implementing scalable GenAI solutions to solve them (often via proof-of-concepts). You will also work closely with account teams, research scientists, and product teams to drive model implementations and new solutions.

You should be passionate about helping companies/partners understand best practices for operating on AWS. An ideal candidate will be adept at interacting, communicating and partnering with other teams within AWS such as product teams, solutions architecture, sales, marketing, business development, and professional services, as well as representing your team to executive management. You will have a natural appetite to learn, optimize and build new technologies and techniques. You will also look for patterns and trends that can be broadly applied across an industry segment or a set of customers that can help accelerate innovation.

This is an opportunity to be at the forefront of technological transformations, as a key technical leader. Additionally, you will work with the AWS GenAI product teams to shape product vision and prioritize features for AI/ML Frameworks and applications. A keen sense of ownership, drive, and being scrappy is a must.

BASIC QUALIFICATIONS

- Bachelor's degree in computer science, engineering, mathematics or equivalent
- Experience developing technology solutions and evangelising end-to-end technology roadmaps that guide IT transformations toward cloud computing
- Experience in specific technology domain areas like software development, cloud computing, systems engineering, infrastructure, security, networking, data and analytics
- Experience communicating across technical and non-technical audiences and at C-level, including training, workshops, publications
- Practical experience in distributed training frameworks and inference servers. Orchestrators/schedulers (one or several of Kubernetes, EKS, Slurm), storage systems (S3, Lustre, POSIX). Experience working with GPUs or custom silicon, profiling and optimization.

PREFERRED QUALIFICATIONS

- Knowledge of distributed systems design and implementation or equivalent
- Knowledge of large scale automation and workflow management or equivalent
- Knowledge of presentations and whiteboarding skills with a high degree of comfort speaking with internal and external executives, IT management, and developers
- Experience architecting, migrating, transforming or modernizing customer requirements to the cloud
- Practical experience in High Performance Computing (HPC) and/or distributed training, performance profiling and optimization.
- Experience in distributed training (PyTorch, Jax, NeMo) and/or inference (NIMS, TRT-LLM, TorchServe, Triton).

Amazon is committed to a diverse and inclusive workplace. Amazon is an equal opportunity employer and does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status. For individuals with disabilities who would like to request an accommodation, please visit https://www.amazon.jobs/en/disability/us.

Los Angeles County applicants: Job duties for this position include: work safely and cooperatively with other employees, supervisors, and staff; adhere to standards of excellence despite stressful conditions; communicate effectively and respectfully with employees, supervisors, and staff to ensure exceptional customer service; and follow all federal, state, and local laws and Company policies. Criminal history may have a direct, adverse, and negative relationship with some of the material job duties of this position. These include the duties and responsibilities listed above, as well as the abilities to adhere to company policies, exercise sound judgment, effectively manage stress and work safely and respectfully with others, exhibit trustworthiness and professionalism, and safeguard business operations and the Company’s reputation. Pursuant to the Los Angeles County Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.

Pursuant to the San Francisco Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.

Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $138,200/year in our lowest geographic market up to $239,000/year in our highest geographic market. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience. Amazon is a total compensation company. Dependent on the position offered, equity, sign-on payments, and other forms of compensation may be provided as part of a total compensation package, in addition to a full range of medical, financial, and/or other benefits. For more information, please visit https://www.aboutamazon.com/workplace/employee-benefits. This position will remain posted until filled. Applicants should apply via our internal or external career site.