GoLess
  • Basics
    • Recording Actions
    • Adding workflow manually
    • Tutorial: Scraping data
  • Workflow
    • Customization
    • Blocks
    • Global Data
    • Variables
    • Element Selector
    • Expressions
    • Workflow Table
    • Looping
    • Google Drive
  • Blocks
    • General
      • Note Block
      • Trigger Block
      • Execute Workflow Block
      • Delay Block
      • Export Data Block
      • HTTP Request Block
      • Blocks Group
      • Clipboard Block
      • Wait Connections Block
      • Notification Block
      • Workflow State
      • Parameter Prompt Block
    • Browser
      • Active Tab Block
      • New Tab Block
      • Switch Tab Block
      • New Window Block
      • Proxy Block
      • Go Back Block
      • Go Forward Block
      • Close Tab/Window Block
      • Take Screenshot Block
      • Browser Event Block
      • Handle Download Block
      • Handle Dialog Block
      • Reload Tab Block
      • Get Tab URL Block
      • Cookie Block
    • Web Interaction
      • Click Element Block
      • Get Text Block
      • Scroll Element Block
      • Link Block
      • Attribute Value Block
      • Forms Block
      • Javascript Code Block
      • Trigger Event Block
      • Switch Frame Block
      • Upload File Block
      • Hover Element Block
      • Save Assets Block
      • Press Key Block
      • Create Element Block
    • Control Flow
      • Repeat Task Block
      • Conditions Block
      • Element Exists Block
      • While Loop Block
      • Loop Data Block
      • Loop Elements Block
      • Loop Breakpoint
    • Data
      • Insert Data Block
      • Delete Data Block
      • Get Log Data Block
      • Slice Variable Block
      • Increase Variable Block
      • RegEx Variable Block
      • Data Mapping Block
      • Sort Data Block
    • Оnline Services
      • Google Sheets Block
    • Premium
      • ChatGPT Block
      • Captcha Block
  • REFERENCE
    • Logs
    • Schedule
    • Storage
    • Packages
    • Condition Builder
    • Workflow Common Errors
    • JavaScript Execution Context
Powered by GitBook
On this page
  1. Basics

Tutorial: Scraping data

Getting basic data from websites.

PreviousAdding workflow manuallyNextWorkflow

Last updated 1 year ago

In this tutorial, we're going to scrape data from this webpage: .

  1. To get started, access the extension, open the dashboard and click on "New Workflow".

  1. Your workflow will start with a trigger. A is an action defining when and under what conditions your automation should execute. By default, the trigger is set to "Manual" mode, meaning the automation will only run when you initiate it yourself.

  1. You have the option to select a different trigger or add multiple triggers for your automation. These could include intervals, schedules, context menus (right-click on web pages), specific dates, on browser start-up, or keyboard shortcuts.

  • To select this .post class, visit the demo.goless.com site and enable Element Selector in the extension.

  • We then hover our mouse over the desired block to obtain information about the required class.

And with that, our workflow setup is complete. Upon running, the automation will save the data from the website in a csv file.

Next, add the "" block. This means that upon automation initiation, a new tab will open with the address you specify, which in this case will be the webpage for scraping: https://demo.goless.com/.

Next, we add a "" block. This block will iterate over the elements on the page as a list. We need to capture all blocks with the class .post. Thus, we specify the CSS Selector as .post.

Next, within the .post element, we need to get the title. To do this, we add a "" block to our workflow. In the settings, we specify: {{ loopData@items }} .title - here we instruct the script to take the elements from the previous block (items in our case, which should be pre-defined as Loop ID) and search within it for the CSS class .title.

If you need to capture several fields and export them, you will need to set up a . Select the "Insert to table" checkbox and choose into which field of the table the data should be added. Click on the table icon in the top-right corner beforehand to create a table format.

To terminate the loop, add a and specify the id of the Loop elements, which is items in our case.

The final block is the data export. You need to add an "" block to download the gathered data upon completion.

New tab
Loop elements
Get text
table
Loop Breakpoint
Export data
https://demo.goless.com/
trigger