Extractors
Extractors are functions that extract information from a webpage.
extract_text
Natural language description of the element to extract text from.
Whether to use vision to find the text. Helpful if Simplex struggles to find the text when use_vision = False, otherwise not necessary.
Extracted text string(s) within the requested element. If there are multiple, separate text chunks within the element (i.e. the element description is all financial headlines on a page), this chunks will be separated by newline characters.
Sample Usage
In this example, we’re building a script to purchase 3D models from CGTrader (a 3D design marketplace) for a synthetic data project. Since we want to use this asset for creating synthetic data, we need to check if the license allows AI use.
Here’s a sample product page (license highlighted in green):
We can extract the license text to check if it allows AI use.
Was this page helpful?