Generate Data from Raw Text
This page explains how to generate data from raw text using Transformer Lab.

Step 1: Download the Generate From Raw Text Plugin
- Go to the Plugins Tab.
- Use the filter by type generator to narrow down the list.
- Download the Generate From Raw Text Plugin.
Step 2: Create a Generation Task
- Navigate to the Generator Tab.
- Click on Create Task.
- From the drop-down menu, select Generate from Raw Text.
- A pop-up window will appear for configuring your generation task.
Step 2.1: Configure Your Task
Name Your Generation Task
- In the first tab of the pop-up window, enter a name for your generation task.
Plugin Configuration
- Move to the next tab labeled Plugin Config.
- Select the generation model from the options available:
- Options include various Claude and OpenAI models, or a local model loaded in the Foundation tab.
- Specify the number of samples you want to generate.
Entering Context
- After configuring the plugin, navigate back to the Context tab.
- Paste the raw context you want to use to generate your datasets.
Step 3: Run the Task
- Once you have saved your evaluation task, click on the Queue button to start the generation process.
- When the generation is complete, the generated dataset will be visible under the Generated Tab in the Training Data section.

Step 4: Preview Your Data
- Go to the Generated in the Training Data section.
- Click on the dataset you generated to preview the data.
