Breakdown of log entries by type
I looked at the catalina.out log records for 2008-09-21. These logs are in the file: catalina.out_2008-09-21
(because of the way that the files are backed up the log entries only extend to 11:30 PM). From these entries, I made a list of log entries by type:
worth noting that of the 62K hits only 20 came from "SimpleSearch". That is only 20 users went to our search engine and the rest searched through google or are robots.
Log types that are required for stats
Jason Roy and I agree that the following log types are need for stats.
||Number in Log
||Found Apache Match
||Apache needs SQL
The "Apache needs SQL" column indicates whether it is required to use information from the dspace SQL database to map the apache logs to dspace catalina logs. Also the term "view_bitstream" corresponds to download.
How apache logs map to catalina logs
To take care of some issues in the catalina logs, I am going to use apache logs. Here are
of the log entries for both apache and catalina for all of the critical log types given in the table above. There are also catalina examples for almost all the types.