Provide Examples and Link a Dataset
In this part of the tutorial, we explore how to enhance the model’s understanding and responses by using a dataset filled with examples. Initially, we worked with a single complaint. Now, by introducing a broader range of examples, the model can better grasp variations and details, leading to more precise categorizations.
Import a Pre-Loaded Hugging Face Dataset #
Datasets are crucial for complex tasks or further refining (fine-tuning) your model.
- Go to the Datasets tab. This is where you can manage different datasets.
- Select New Dataset.
- In Dataset type, select the Hugging Face tab.
- Fill in the details:
- Dataset name:
determined-ai/customers-complaints
- Description:
Customer complaints pre-processed and classified into issue categories
- Config name: Leave this field blank. This dataset does not have multiple configurations, or sub-parts. See Hugging Face Configurations for more details.
- Dataset name:
- Select Create Dataset.
Your dataset is now ready to be linked to your project!
Link the Dataset #
Connecting your dataset to the model allows it to use the real data during the response generation.
-
Go to the Snapshots tab and select your saved snapshot to open the playground.
-
Open the dataset drawer, then in Load dataset, choose the
determined-ai/customers-complaints
dataset. -
Preview the dataset samples to understand the data structure, then select Load.
The dataset feature columns like
Product
andConsumer_complaint_narrative
are essential for your next steps.
Set Up Your Prompt #
Provide clear instructions to the model on how to use the data:
Instruction
You are a customer support expert in a financial sector. Your task is to classify customer complaints
as related to a particular product from the following list. Don't include any explanation.
Only respond with one product from the numbered list.
Categories:
1. Credit reporting, credit repair services, or other personal consumer reports
2. Credit card or prepaid card
3. Credit card
4. Student loan
5. Debt collection
6. Checking or savings account
7. Credit reporting or other personal consumer reports
8. Vehicle loan or lease
9. Payday loan, title loan, or personal loan
10. Mortgage
Examples
Below are examples:
Complaint: {{Consumer_complaint_narrative}}
Answer: {{Product}}
Input
Complaint: {{Consumer_complaint_narrative}}
Answer:
Expected Output
{{Product}}
Generate and Review the Model’s Response #
- Select Generate.
- Review the generated output and check how accurately the complaints are categorized.
By providing examples and linking a dataset, we can see the new output contains the original linked dataset with new columns including a column where the model has categorized the complaint.
Save Your Progress #
After adjustments and testing:
- Select Save Snapshot.
- Select Save as New Snapshot and name it
customer complaints snapshot2
to capture this state. - Select Save.
Continuing to refine your model might involve crafting more complex prompts or conducting additional fine-tuning.
Recap #
- Provide Examples and Link a Dataset: By incorporating examples and connecting a dataset, you enabled the model to understand and process a broader range of data. This step is crucial for enhancing the model’s ability to categorize information accurately.