Web Page crawler
Tags: primo , pipes Last Updated: Oct 30, 2009 01:17
- Description
A collection of scripts and configuration files used to convert web page content to XML. Pipe to load into Primo is included. This is based on Swish-e spider.pl open source software. See README.txt file for more detail.
- Author: John Osborn
- Additional author(s):
- Institution: University of Iowa
- Year: 2009
- License: BSD style
- Short description: Use, modification and distribution of the code are permitted provided the copyright notice, list of conditions and disclaimer appear in all related material.
- Link to terms: [Detailed license terms]
- Skill required for using this code:
Advanced
State
Stable (in our environment)
Programming language
perl, SQL
Software requirements
perl, perl DBI
Screen captures


