DevJobs

Staff Data Scientist, LLM Modeling

Overview
Skills
  • Python Python
  • SQL SQL
  • Linux Linux
  • Hive
  • SparkSQL
  • Vertica
Overview

Come join the GenAI team as a Staff Data Scientist!

We are building the Intuit Foundational LLM, as part of a proprietary Generative AI operating system (GenOS) platform.

What you'll bring

  • NLP knowledge and affinity to textual data
  • Deep interest in cutting-edge innovative technologies in Generative AI
  • Deep technical understanding of underlying DS concepts (not just training models)
  • Collaboration with partners across the globe, to deliver complex projects Maturity
  • Quick learner, adaptable, with the ability to work independently in a fast-paced environment
  • Strong verbal and written communication skills. Ability to conduct meetings and make professional presentations, and to explain complex concepts and technical material to non-technical users
  • Strong project management and stakeholder management skills
  • We welcome people who can deliver E2E AI projects (inception to production). We primarily use Python in all stages of development
  • Fluent in SQL enough to get the data you need from a warehouse (Vertica, Hive, SparkSQL)
  • Comfortable working in a Linux environment
  • Experience with building end to end reusable pipelines from data acquisition to model output delivery

How you will lead

  • You’ll apply proven methods and hacking skills in working with divergent data types, data scales, and big data — to explore and extrapolate data-driven insights using advanced, predictive statistical modeling and testing applied to data acquired and cleansed from a range of sources
  • You’ll use considerable expertise and independent judgment in collaborating with peers, data engineers, database managers, business analysts, architects, and product managers in designing and implementing the research strategy needed to methodically and iteratively structure, extract, cleanse, sample, test, validate, and communicate data-driven insights from complex sources and significant volumes of data for complex and unique business problems
  • You’ll provide guidance and support leadership to business leaders and stakeholders, on how best to harness available data in support of critical business needs and goals
  • You’ll lead the full cycle of iterative big data exploration, including hypothesis formulation, algorithm development, data cleansing, testing, insight generation, and visualization, and action planning
  • You’ll provide business stakeholders with entrepreneurial guidance essential for appropriately interpreting and building on findings, and fully exploiting the insights revealed through the research
Intuit