Projects
- Content Extraction Via Text Density
-
We Proposed a DOM based content extraction approach via text density. This method improves the quality of
structural content extraction of web pages and retains the original structural information in the web page
cleaning process.