It takes as input a web server log file, and outputs statistics in form of different reports
The design principles are very different compared to other software of the same type.
Graph generation can be combined with Graphviz.
Here are some key features of "Visitors":
· No installation required, can process up to 150,000 lines of log entries per second in fast computers (20MB/s with my log files average length).
· Designed to be executed by the command line, output HTML and text reports. The text report can be used in pipe to less to check web stats from ssh.
· Support for real time statistics with the Visitors Stream Mode introduced with version 0.3.
· To specify the log format is not needed at all. Works out of box with Apache and most other web servers with a standard log format.
· It's a portable C program, can be compiled on many different systems.
· The produced HTML report doesn't contain images or external CSS, is self-contained, you can send it by email to users.
· Requested pages.
· Requested images.
· Referrers by hits and age.
· Unique visitors in each day.
· Page views per visit.
· Pages accessed by the Google crawler (and the date of Google's last access on every page).
· Percentage of visits originated from Google searches for every day.
· User's navigation patterns (web trails).
· Keyphrases used in Google searches.
· User agents.
· Weekdays and Hours distributions of accesses.
· Weekdays/Hours combined bidimentional map.
· Month/Year combined bidimentional map.
· Visual path analysis with Graphviz.
· Operating systems, browsers and domains popularity.
· 404 errors.
· Under WIN32 the user needs MINGW and MSYS
What's New in This Release: [ read full changelog ]
Changes for the 0.7 version include:
· --grep and --exclude options to process only line matching or not matching specified patterns. Patterns can include jolly characters like *, ?, ranges of chars and so on, this kind of matching is called glob-style matching (see the online documentation for more information).
· Multiple grep/exclude patterns can be used at the same time and are processed sequentially.
· Added --ignore-404 in order to avoid that 404 errors are processed like other log lines to create statistics, useful for sites where there are a number of requests producing 404 error codes.
· Some other minor fix.