Skip to content

Fine-tune for a Domain Text Generation #111

Description

@Aminolroaya

Hi I want to fine-tune BioGPT in HuggingFace for Q&A in specific domain. I am using BioGptForCausalLM.from_pretrained("microsoft/biogpt")

When I train the model using Trainer.Train(), I get the error "The model did not return a loss from the inputs, only the following keys: logits,past_key_values. For reference, the inputs it received are input_ids,attention_mask."

Which checkpoint call should I use?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions