Pline Product Docs
  • 🚀Introduction
    • Key Features
    • Key Terminologies
  • Pline Browser Extension
  • Signing Up for Pline
    • Signing Up for Pline
  • Automated Data Extraction Mode
    • How to Build an Automation Workflow
    • How to Run an Automation Workflow
    • Inner Page Data Extraction
    • Limit Record Extraction
    • Wait Timer for Automation
    • Add to Existing Dataset
    • Multi Type Data Selector
  • Browse and Capture
    • How to Build a Browse & Capture Workflow
    • How to Run a Custom Browse and Capture Workflow
    • Multi-Tab Data Extraction
    • Add Alternate Selectors
  • Workflows
    • Custom Workflows
    • Prebuilt Workflows
    • Workflow Status
  • Pline Platform Navigation
    • Accessing Datasets
    • Editing Datasets
    • Filtering Datasets
    • Downloading Datasets
    • Tracking Workflows
      • Delete Workflow
      • View Workflow History
    • Updating Profile
    • Credit Usage
    • Field Name Recommendations
  • Team Collaboration
    • Inviting a New Team Member
    • Managing Team Members
    • Roles in Team Colloboration
  • Scheduling Workflows
    • Creating a New Schedule
    • Managing Scheduled Workflows
    • Viewing Scheduled Run Details
    • Workflow Schedule Status
    • Proof of Record
  • Release Notes
    • Pline v 1.10.12
    • Pline v 1.10.11
    • Pline v 1.10.10
    • Pline v 1.10.9
    • Pline v 1.10.8
    • Pline v 1.10.7
    • Pline v 1.10.6
    • Pline v 1.10.5
    • Pline v 1.10.4
    • Pline v 1.10.3
    • Pline v 1.10.2
  • Platform Domain Change & Extension Sync
Powered by GitBook
On this page
  • Launch the Pline Extension
  • Building an Automation Workflow
  • Step 1: Select Page Type
  • Step 2: Grouping Similar Items
  • Step 3: Select Data Fields
  • Step 4. Select Pagination Type
  • Step 5: Save the Workflow
  1. Automated Data Extraction Mode

How to Build an Automation Workflow

Learn How to Build Pline's Automated Data Workflow to Gather Large Datasets Effortlessly

PreviousAutomated Data Extraction ModeNextHow to Run an Automation Workflow

Last updated 2 months ago

Pline's Automated Data Extraction mode helps you streamline your data collection process. It quickly gathers large amounts of data automatically without needing manual interventions. Seamlessly extract hundreds of records within a few minutes with Pline—no coding required.

A workflow is a custom template you create for extracting data from web pages. 
It defines the structure for the data extraction process.

Launch the Pline Extension

  • Go to the website where you want to extract data.

  • Open Pline Extension.

  • Select “Automated Data Extraction Mode” to get started.

Building an Automation Workflow

In this guide, we'll extract men's shoes data from Amazon.com

  • Click on "Build Workflow" to get started.

Step 1: Select Page Type

Webpages present data in various layouts, such as Vertical Lists, Grid Layouts, or Horizontal Lists.

  • Choose the layout that matches the page structure (e.g., Grid List).

Step 2: Grouping Similar Items

Grouping items enables Pline to recognize and select similar data points across the page, such as product names, prices, or ratings, while ignoring unrelated elements.

This focused approach reduces the chance of capturing irrelevant information, making the extraction more accurate and efficient.

By narrowing down the selection to a specific area, grouping helps automate data extraction across similar items on the page, saving time and enhancing data quality, especially when working with large lists of comparable entries like products or services.

  • Select two similar items from the page (e.g., product image).

Pline automatically identifies and groups similar data points, ensuring accurate and focused extraction.

Step 3: Select Data Fields

In this step, we’ll specify the data fields to extract from the Amazon page.

Pline supports seamless handling of multiple data formats within a single workflow for a more streamlined extraction experience.

Select the required fields directly from the page and choose your desired data type. For example, enter a field name like “Product_name,” then open the Extract drop-down menu and select the type, such as “Text.”

1

Identify and click the data points you want to extract (e.g., product name, price, or URL).

2

Add a field name (e.g., "Product_name").

3

Select the data type from the drop-down menu (e.g., "Text").

If multiple data types (e.g., Link) are required for the selected attribute, follow these steps:

  1. Click on the checkbox before Get Link and provide the appropriate field name.

  2. Click the check mark (✓) to proceed.

Once you have selected multiple data types for required attributes with appropriate field names, select Save to continue.

4

Preview the sample data. If correct, save the selection.

Repeat this process for all required fields.

  • When done, click “Next” . You can choose as many data fields from the same page as you want for web scraping. Once you have selected all the required data fields, click Next.

Step 4. Select Pagination Type

Now, we'll choose the pagination type, which varies depending on the pagination style of the target website.

  • As Amazon uses the Next pagination, we'll select the "Next Button" on the Pline extension panel.

  • Tag the "Next Button" or configure it accordingly to complete this step.

Your workflow is ready for data extraction!

  • Click "View sample data" to see a small sample of captured data.

Step 5: Save the Workflow

Preview the sample data extracted with this workflow. If it meets your expectations, add a workflow name and click "Save workflow" to save your Automation workflow.

  • Use the workflow immediately or save it for future use.

Saved workflows are stored under the My Workflow tab.

For a detailed guide on selecting multiple data types within a workflow and the supported data types, refer to .

You can skip this step for now. Or, if you want to learn inner page extraction, click .

Multi-Type Data Selector
here
Pline panel open on ecommerce like  Amazon with "Build workflow" button highlighted for starting data extraction.
Pline prompts user to select page layout type with "Grid" option selected.
Pline instructs user to group similar items on Amazon by selecting two product cards.
Pline panel displays field name input for extracting product name from Amazon listings.
Pline shows selected data fields like product name, ratings, and brand from Amazon.
Pline asks user to choose a pagination method, with “Next button” option selected.
User tags the "Next" pagination button on Amazon to allow Pline to navigate pages.
Pline prompts user to extract fields from detailed product page linked by product name.
 Pline shows a preview of extracted sample data and workflow name before saving.
Confirmation message that Amazon shoes workflow is saved with option to use it now or later.