Senior Applied Scientist, NLP/GenAI

Thomson Reuters Holdings Inc. • Eagan, MN • 3h ago

Document understanding is a foundational intelligence layer that powers every major capability across our legal AI platform-from search and information extraction to agentic reasoning in products like Westlaw, PracticalLaw, and CoCounsel. You'll build state-of-the-art semantic chunking, document enrichment, and knowledge graph construction systems that serve as the cognitive foundation multiple product teams depend on, working across authoritative legal, tax and accounting content and extraordinarily diverse customer data. This is a rare opportunity to solve publishing-quality research problems with immediate production impact-your innovations will directly shape how millions of legal professionals research, analyze, and reason over complex legal documents while advancing the capabilities that enable the next generation of intelligent legal AI agents. About the role As an Senior Applied Scientist you will: Innovate & Deliver:Design, build, test, and deploy end-to-end AI solutions for complex document understanding tasks in the legal domain. Develop advanced models for semantic chunking of lengthy, non-uniformly structured legal documents with adjustable granularity levels for different use cases. Build document enrichment systems that classify documents according to legal and customer-defined taxonomies and extract rich metadata. Create LLM-based knowledge graph construction pipelines that extract and link heterogeneous legal knowledge including citations, entities, and legal concepts across diverse legal content. Develop scalable synthetic data generation systems to support model training, simulate complex legal research queries and generate hallucination-free answers. Work in collaboration with engineering to ensure well-managed software delivery and reliability at scale. Evaluate & Optimize:Develop comprehensive data and evaluation strategies for both component-level and end-to-end quality, leveraging expert human annotation and synthetic data generation. Apply robust training and evaluation methodologies that balance model performance with latency requirements, particularly for SLM-based solutions. Apply knowledge distillation techniques to compress large models into efficient SLMs suitable for production deployment. Drive Technical Decisions:Independently determine appropriate architectures for challenging document understanding problems including: semantic chunking strategies that handle diverse document formats, preserve legal document structure, and adapt to different granularity needs; document classification approaches that work across varying legal taxonomies and generalize to customer-defined schemas; LLM-based knowledge extraction methods that handle challenges like citation recognition errors and @contextual references; multi-document reasoning architectures for generating synthetic multi-hop queries that reflect complex legal research patterns. Balance accuracy, efficiency, and scalability while solving real-world challenges like handling diverse document formats and content @types. Align & Communicate:Partner closely with Engineering and Product teams to translate complex legal document understanding challenges into scalable, production-ready solutions. Engage stakeholders across multiple product lines to deeply understand use case requirements, shaping objectives that align document understanding capabilities with diverse business needs including next-generation search and deep legal research. Advance the Field: Maintain scientific and technical expertise in one or more relevant areas as demonstrated through product deliverables, published research at top venues (e.g., ACL, EMNLP, ICLR, NeurIPS, SIGIR, KDD) , and intellectual property. About You PhD in Computer Science, AI, NLP, or a related field, or a Master's with equivalent research/industry experience 5+ years of hands-on experience building and deploying document understanding systems, information extraction pipelines, or knowledge graph construction usingdeep learning, LLMs and NLP methods Proven ability to translate complex document understanding problems into innovative AI applications that balance accuracy and efficiency Professional experience scaling yourself and leading through others, in an applied research setting Strong programming skills (e.g., Python) and experience with modern deep learning frameworks (e.g., PyTorch, Hugging Face Transformers, DeepSpeed) Publications at relevant venues such as ACL, EMNLP, ICLR, NeurIPS, SIGIR, KDD Technical Qualifications Deep understanding of document understanding fundamentals: document layout analysis, semantic chunking approaches beyond fixed-size or paragraph-based methods, document classificationhandling hierarchical taxonomies, imbalanced multi-label classification, and adapting to domain-specific schemas Expertise in knowledge extraction and knowledge graph construction: entity recognition and linking, relation extraction, citation parsing, and building graph representations from unstructured text Expertise in LLM-based information extraction, few-shot and multi-task learning, post-training and knowledge distillation Solid understanding of synthetic data generation techniques for NLP, including query - answer generation with verification and scalable data augmentation for training specialized models Solid understanding of efficiency optimization including knowledge distillation, model compression, and designing SLM-based solutions that balance performance with computational constraints Solid understanding of DL/ML approaches used for NLP tasks Experience designing annotation workflows, creating high-quality labeled datasets with clear guidelines, and developing evaluation frameworks for document understanding tasks Preferred Qualifications Prior work on legal document understanding, legal information extraction, knowledge representationincluding legal citations and legal domain concepts or legal AI applications Prior work handling complex document structures common in legal documents: non-uniform formatting, nested hierarchies, cross-references, and embedded elements Experience with building systems that perform analysis, question answering or retrieval across large document collections Experience with knowledge graph frameworks and methodologies for legal or enterprise applications Understanding of RAG and agentic workflows for enterprise knowledge Publications at relevant venues such as ACL, EMNLP, ICLR, NeurIPS, SIGIR, KDD Experience working with AzureML or AWS SageMaker #LI-LP2 What's in it For You? Hybrid Work Model: We've adopted a flexible hybrid working environment (2-3 days a week in the office depending on the role) for our office-based roles while delivering a seamless experience that is digitally and physically connected. Flexibility & Work-Life Balance: Flex My Way is a set of supportive workplace policies designed to help manage personal and professional responsibilities, whether caring for family, giving back to the community, or finding time to refresh and reset. This builds upon our flexible work arrangements, including work from anywhere for up to 8 weeks per year, empowering employees to achieve a better work-life balance. Career Development and Growth: By... For full info follow application link. As a global business we rely on diversity of culture and thought to deliver on our goals. To ensure we can do that, we seek talented, qualified employees in our operations around the world regardless of race, color, sex/gender, including pregnancy, gender identity and expression, national origin, religion, sexual orientation, disability, age, marital status, citizen status, veteran status, or any other protected classification under country or local law. Thomson Reuters is proud to be an Equal Employment Opportunity Employer providing a drug-free workplace.

Similar Jobs

PI R&D Engineering Intern

Boston Scientific • Maple Grove, MN • 3h ago

IT & Engineering

Additional Location(s): N/A Diversity - Innovation - Caring - Global Collaboration - Winning Spirit- High Performance At Boston Scientific, we'll give you the opportunity to harness all that's within you by working in te...

3h ago

Go to Apply

Systems Assembly Technician

Insight Global • Eden Prairie, MN • 3h ago

IT & Engineering

Job Description Insight Global is seeking a Systems Assembly Technician to join a team responsible for building and repairing advanced mechanical test systems. This hands-on role requires technical expertise and the abil...

3h ago

Go to Apply

Graduate Environmental Engineer

GHD • Saint Paul, MN • 3h ago

IT & Engineering

Job Description There's no pledge more important than the one we make to look after our environment, and we're committed to helping you do exactly that. Join us as we strive to ensure a sustainable future for our communi...

3h ago

Go to Apply

Director, Software Engineering

Oracle • Saint Paul, MN • 4h ago

IT & Engineering

Job Description Lead, mentor, and manage a team of engineers focused on software development, operations, and distributed systems. Oversee the design, development, testing, debugging, and deployment of software applicati...

4h ago

Go to Apply

Senior Software Engineer

TSI Incorporated • Shoreview, MN • 4h ago

IT & Engineering

Who Are We? Are you someone that likes new challenges? Do you want to explore new technologies and help solve some of the world's most difficult problems, like air quality and safety solutions? At TSI, these opportunitie...

4h ago

Go to Apply

Software Engineer in Test

Beacon Hill Staffing Group, LLC • Saint Paul, MN • 4h ago

IT & Engineering

Beacon Hill was founded to set a new standard in search, career placement and flexible staffing. This is a backend-focused QA automation and testing infrastructure role. Looking for experienced server-side engineers (Jav...

4h ago

Go to Apply

Senior Software Engineer - Incydr

MIMECAST NORTH AMERICA INC • Minneapolis, MN • 4h ago

IT & Engineering

Senior Software Engineer - Incydr Product Development Mimecast is at the forefront of the cybersecurity industry, delivering innovative solutions to protect businesses and individuals from evolving threats. Our mission i...

4h ago

Go to Apply

Software Engineer II

MIMECAST NORTH AMERICA INC • Minneapolis, MN • 4h ago

IT & Engineering

Build Engineer II Mimecast is at the forefront of the cybersecurity industry, delivering innovative solutions to protect businesses and individuals from evolving threats. Our mission is to empower organizations with secu...

4h ago

Go to Apply

Network Systems Engineer

McKesson Corporation • Minneapolis, MN • 11h ago

IT & Engineering

McKesson is an impact-driven, Fortune 10 company that touches virtually every aspect of healthcare. We are known for delivering insights, products, and services that make quality care more accessible and affordable. Here...

11h ago

Go to Apply

Senior Engineer Supervisor - Electrical

University of Minnesota - 15th Ave • Minneapolis, MN • 12h ago

IT & Engineering

Apply for Job Job ID370830 LocationTwin Cities Job FamilyCampus Operations Full/Part TimeFull-Time Regular/TemporaryRegular Job Code9742BD Employee ClassAcad Prof and Admin Add to Favorite Jobs Email this Job About the J...

12h ago

Go to Apply

Subscribe to get new jobs sent directly to you
(no charge, cancel anytime)

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.