Web scraping technique is a sub-discipline of web mining technology. Web Mining stays at the crossroads of Information Retrieval, Information Extraction and Data Mining. Exploring better these techniques is extremely necessary to cope with the amount of available data in the Information Overload Era. With the Web being oriented towards the importance of semantics and integration of information, these areas of study become very important to address the new future trends.
Information Retrieval is a sub-part of artificial intelligence. IE mainly focus on to extract valuable data out of unstructured data.
Information extraction is a necessary pre-processing step to structure data before a statistical data mining algorithm can build knowledge from it. To extract a useful and important data from a retrieved information different techniques are used. In this paper we are focus on the web scraping techniques and discuss some of the tools which are available for web scraping.