Generate Data from Scratch
This page explains how to generate data from just concepts of a dataset using Transformer Lab.
Step 1: Download the Generate From Scratch Plugin
- Go to the Plugins Tab.
- Use the filter by type generator to narrow down the list.
- Download the Generate From Scratch Plugin.

Step 2: Create a Generation Task
- Navigate to the Generator Tab.
- Click on Create Task.
- From the drop-down menu, select Generate from Scratch.
- A pop-up window will appear for configuring your generation task.
Step 2.1: Configure Your Task
Name Your Generation Task
- In the first tab of the pop-up window, enter a name for your generation task.
Plugin Configuration
- Move to the next tab labeled Plugin Config.
- Select the generation model from the options available:
- Options include various Claude and OpenAI models, or a local model loaded in the Foundation tab.
- Specify the number of samples you want to generate.
Entering Dataset Concepts
- Scenario: Describe the scenario for which you'd like to generate the data. e.g.
Less knowledgeable fans trying to know more about the basketball game.
- Task: Describe the task you'd like to generate the data for. e.g.
Answer questions about rules of basketball
- Input Format: Describe the input format for the data. e.g.
Questions about basketball rules
- Expected Output Format: Describe the output format for the data. e.g.
Answers to the questions about basketball rules

Step 3: Run the Task
- Once you have saved your evaluation task, click on the Queue button to start the generation process.
- When the generation is complete, the generated dataset will be visible under the Generated Tab in the Training Data section.

Step 4: Preview Your Data
- Go to the Generated in the Training Data section.
- Click on the dataset you generated to preview the data.
