![]() Replace with regular expression Function: Use a specific regular expression to replace the matched string/s in the extracted data with the string/s that you want. Replace Function: Replace the specific string/s in the extracted data with the new string/s that you want.Ģ. If you see the word "string" there, that means you can use the corresponding options to deal with a variety of character types in the data extracted, such as letters, words, sentences, numbers, spaces, symbols, and punctuation marks.ġ. You would see the word "string" in a lot of function instructions of Octoparse's data reformat options. If you replace a word with an empty string, colloquially, it is equal to saying that you delete the word. In other words, a string that contains no character is empty. A string can consist of no character as well. ![]() For example, " " (space) is a string "Octoparse" is a string and "Hello 2 *% World!" is also a string. In programming, a "string" basically refers to a collection of characters like letters, numerals, symbols, and punctuation marks. ![]() Select an operation to re-format your data Click on the "." icon and select "Clean data".Ĥ. To access these features in Octoparse, you should follow the 4 steps below:Ģ. How to refine the extracted data in Octoparse? No need to re-format the field after exporting the data into an excel file. Octoparse would scrape and refine it directly during the scraping process. If you have a desired data format for a certain field, you can use our "Clean Data" function to refine the field within Octoparse. Sharpen your skills and explore new ways to use Octoparse.ĭuring your web scraping project, you may want to clean the data fields while doing the web scraping. Octoparse offers 9 data cleaning options for turning the extracted data into the format you need. For the latest tutorials, visit our new self-service portal.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |