Skip to main content
This is the carpentries repository for our funded project "How to Build FAIR Domain-Specific Datasets for fine tuning/training NLP models" Resources

    This is the carpentries repository for our funded project "How to Build FAIR Domain-Specific Datasets for fine tuning/training NLP models" Resources
    How to Build FAIR Domain-Specific Datasets for fine tuning/training NLP models
    • How to Build FAIR Domain-Specific Datasets for fine tuning/training NLP models
    • Key Points
    • Glossary
    • Learner Profiles
      • Reference
    Search the All In One page
    How to Build FAIR Domain-Specific Datasets for fine tuning/training NLP models
    %
  • Instructor View

    Summary and Setup
    1. Introduction to NLP tasks and Fine-Tuning
    2. Identifying and Collecting Domain-Specific Data
    3. Preprocessing Biomedical Text Data
    4. Annotation Strategies for Domain-Specific NLP Tasks
    5. Quality Assurance and Validation of Datasets
    6. Challenges and possible solutions to create datasets
    7. FAIRification of Domain-Specific Datasets

    • Key Points
    • Glossary
    • Learner Profiles
    • Reference

    See all in one page

    Introduction to NLP tasks and Fine-Tuning


    Identifying and Collecting Domain-Specific Data


    Preprocessing Biomedical Text Data


    Annotation Strategies for Domain-Specific NLP Tasks


    Quality Assurance and Validation of Datasets


    Challenges and possible solutions to create datasets


    FAIRification of Domain-Specific Datasets



    This lesson is subject to the Code of Conduct

    Edit on GitHub | Contributing | Source

    Cite | Contact | About

    Materials licensed under CC-BY 4.0 by the authors

    Template licensed under CC-BY 4.0 by The Carpentries

    Built with sandpaper (0.16.12), pegboard (0.7.9), and varnish (1.0.7.9000)


    Back To Top