Importing HTML Files
The maximum size an HTML file can be when importing or uploading, by default, is 4194304 bytes (4mb). This setting can be changed by using the property:
das.import.whole-text-file.max-size=<size in mb>
Set this for a specific job in the Configuration settings by adding it to the Custom Properties field.
Set this globally by adding the property to the file <Datameer path>/das-common.properties
. After this property is added, Datameer doesn't need to be restarted.
Importing HTML Files
- Create a new Import Job or File Upload.
- Choose your connector where the HTML file is stored and then choose HTML File Type.
Select the HTML file or folder to import.
Adjust the HTML parse option as needed:1 Parse the HTML file with no changes 2 Parse the HTML file after removing the metadata information 3 Parse the HTML file after removing the metadata information and all HTML tags - Review the sample data.
- Save and run the new import job or file upload.