Skip to main content

Generate Data from Raw Text

This page explains how to generate data from raw text using Transformer Lab.

Docs Upload Gif

Step 1: Download the Generate From Raw Text Plugin

  • Go to the Plugins Tab.
  • Use the filter by type generator to narrow down the list.
  • Download the Generate From Raw Text Plugin.

Step 2: Create a Generation Task

  • Navigate to the Generator Tab.
  • Click on Create Task.
  • From the drop-down menu, select Generate from Raw Text.
  • A pop-up window will appear for configuring your generation task.

Step 2.1: Configure Your Task

Name Your Generation Task

  • In the first tab of the pop-up window, enter a name for your generation task.

Plugin Configuration

  • Move to the next tab labeled Plugin Config.
  • Select the generation model from the options available:
    • Options include various Claude and OpenAI models, or a local model loaded in the Foundation tab.
  • Specify the number of samples you want to generate.

Entering Context

  • After configuring the plugin, navigate back to the Context tab.
  • Paste the raw context you want to use to generate your datasets.

Step 3: Run the Task

  • Once you have saved your evaluation task, click on the Queue button to start the generation process.
  • When the generation is complete, the generated dataset will be visible under the Generated Tab in the Training Data section.
Docs Upload Gif

Step 4: Preview Your Data

  • Go to the Generated in the Training Data section.
  • Click on the dataset you generated to preview the data.
Docs Upload Gif