Staff Software Engineer, ML Performance & Systems

Company: Fal
Location: San Francisco
Posted on: February 3, 2025

Job Description:

Staff Software Engineer, ML Performance & SystemsHelp fal maintain its frontier position on model performance for generative media models. Design and implement novel approaches to model serving architecture on top of our in-house inference engine, focusing on maximizing throughput while minimizing latency and resource usage. Develop performance monitoring and profiling tools to identify bottlenecks and optimization opportunities. Work closely with our Applied ML team and customers (frontier labs on the media space) and make sure their workloads benefit from our accelerator.Key Responsibilities:

Help fal maintain its frontier position on model performance for generative media models.
Design and implement novel approaches to model serving architecture on top of our in-house inference engine, focusing on maximizing throughput while minimizing latency and resource usage.
Develop performance monitoring and profiling tools to identify bottlenecks and optimization opportunities.
Work closely with our Applied ML team and customers (frontier labs on the media space) and make sure their workloads benefit from our accelerator.Requirements:
- Strong foundation in systems programming with expertise in identifying and fixing bottlenecks.
- Deep understanding of cutting edge ML infrastructure stack (anything from PyTorch, TensorRT, TransformerEngine to Nsight), including model compilation, quantization, and serving architectures. Ideally following closely the developments in all these systems as they happen.
- Have a fundamental view of the underlying hardware (Nvidia based systems at the moment), and when necessary go deeper into the stack to fix bottlenecks (custom GEMM kernels with CUTLASS for common shapes).
- Proficient in Triton or willingness to learn with comparable experience in lower-level accelerator programming.
- New frontier: multi-dimensional model parallelism (combining multiple parallelism techniques like TP with context parallel / sequence parallel).
- Familiar with internals of Ring Attention, FA3, FusedMLP implementations.Compensation:
  - $180,000 - $500,000 + equity + comprehensive benefits packageWhat we offer at fal
    - Interesting and challenging work
    - Work-life balance
    - Competitive salary and equity
    - Employee-friendly equity terms (early exercise, extended exercise)
    - We are currently hiring in downtown San Francisco. We prefer to work in-person but we also offer remote work opportunities for exceptional candidates.
    - We offer visa sponsorship and will help you relocate to San Francisco.
    - Health, dental, and vision insurance (US)
    - Regular team events and offsites
    - 4 weeks of paid vacationTo Apply:Reach out to hello@fal.ai with your resume and any relevant links to your work or publications.
      #J-18808-Ljbffr

Keywords: Fal, Cupertino , Staff Software Engineer, ML Performance & Systems, IT / Software / Systems , San Francisco, California

Click here to apply!

Didn't find what you're looking for? Search again!

Let San Francisco recruiters find you. Post your resume for free!

Get San Francisco IT / Software / Systems jobs via email.

View more Cupertino IT / Software / Systems jobs

Other IT / Software / Systems Jobs

Systems Administrator
Description: A company is looking for a Systems Administrator for POS Server Migration. br br br br Key Responsibilities
Company: VirtualVocations
Location: Salinas
Posted on: 01/30/2025

Digital Technology Rotational Program
Description: A company is looking for a Digital Technology Rotational Program participant. br br br
Company: VirtualVocations
Location: Salinas
Posted on: 01/30/2025

Software Engineer, Generative AI Hot Job Palo Alto, California Department Engineering
Description: Launched in 2012, Tinder revolutionized how people meet, growing from 1 match to one billion matches in just two years. This rapid growth demonstrates its ability to fulfill a fundamental human need: (more...)
Company: Lifeattinder
Location: Palo Alto
Posted on: 01/30/2025

Salary in Cupertino, California Area | More details for Cupertino, California Jobs |Salary

AIML - Senior Software Engineer- Simulation - AIML Special Projects
Description: AIML - Senior Software Engineer- Simulation - AIML Special ProjectsApple is where individual imaginations gather together, committing to the values that lead to great work. Every new product we build, (more...)
Company: Apple Inc.
Location: Sunnyvale
Posted on: 01/30/2025

Software Engineer
Description: Company OverviewKLA is a global leader in diversified electronics for the semiconductor manufacturing ecosystem. Virtually every electronic device in the world is produced using our technologies. No laptop, (more...)
Company: KLA-Belgium
Location: Milpitas
Posted on: 01/30/2025

Software Engineer - Python, Swedish
Description: A company is looking for a Software Engineer for Training AI Data - Python, Swedish. br br br
Company: VirtualVocations
Location: Salinas
Posted on: 01/30/2025

Staff Software Engineer, Security/Privacy, Google Cloud Security and Privacy
Description: Staff Software Engineer, Security/Privacy, Google Cloud Security and PrivacyLocation: San Francisco, CA, USA Sunnyvale, CA, USAAdvancedExperience owning outcomes and decision making, solving ambiguous (more...)
Company: Google Inc.
Location: Sunnyvale
Posted on: 01/30/2025

Software Engineer, Infrastructure
Description: A company is looking for a Software Engineer, Infrastructure. br br br br Key Responsibilities
Company: VirtualVocations
Location: Salinas
Posted on: 01/30/2025

SAP Security Analyst
Description: A company is looking for a SAP Security Analyst, requiring US citizenship and 3-5 years of experience in SAP Security.
Company: VirtualVocations
Location: Salinas
Posted on: 01/30/2025

Software Engineering Intern - Autonomy, Perception
Description: About RivianRivian is on a mission to keep the world adventurous forever. This goes for the emissions-free Electric Adventure Vehicles we build, and the curious, courageous souls we seek to attract.As (more...)
Company: Rivian
Location: Palo Alto
Posted on: 01/30/2025

Loading more jobs...

Staff Software Engineer, ML Performance & Systems

Didn't find what you're looking for? Search again!

Other IT / Software / Systems Jobs

Log In or Create An Account