We require a script that can process large Apache logfiles (> 1GB) and place them in a database. Because this script is being used for Search Engine Optimization purposes, we do not need all data put into the database. Thus, while parsing, data needs to be dropped, if the referrer is not a search engine. Also, a specific log entry may be parsed more than once, so the script needs to hash the line (based on time, referrer, ip etc) in order to make sure the loglines in database are unique. Another script is needed that can now generate reports based on the data in the database. We need to have a report that details how many unique ip addresses have been referred from search engines, and what search words were used. These details will need to be presented for a certain month, year, or all-time. Also, the script needs to be able to display all details from a specific search engine.
## Deliverables
1) Complete and fully-functional working program(s) in executable form as well as complete source code of all work done.
2) Deliverables must be in ready-to-run condition, as follows (depending on the nature of the deliverables):
a) For web sites or other server-side deliverables intended to only ever exist in one place in the Buyer's environment--Deliverables must be installed by the Seller in ready-to-run condition in the Buyer's environment.
b) For all others including desktop software or software the buyer intends to distribute: A software installation package that will install the software in ready-to-run condition on the platform(s) specified in this bid request.
3) All deliverables will be considered "work made for hire" under U.S. Copyright law. Buyer will receive exclusive and complete copyrights to all work purchased. (No GPL, GNU, 3rd party components, etc. unless all copyright ramifications are explained AND AGREED TO by the buyer on the site per the coder's Seller Legal Agreement).
## Platform
PHP, MySQL, Apache Common Log Format, Linux Red Hat.