Misplaced Pages

User:Killiondude/stats: Difference between revisions

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.
< User:Killiondude Browse history interactively← Previous editNext edit →Content deleted Content addedVisualWikitext
Revision as of 22:24, 14 January 2012 editSelery (talk | contribs)1,132 edits How do I see stats for this month? A link doesn't show up!: /latest/← Previous edit Revision as of 00:11, 9 March 2012 edit undoKilliondude (talk | contribs)Extended confirmed users28,867 edits Updating page contentsNext edit →
Line 22: Line 22:
===How often are the stats updated?=== ===How often are the stats updated?===
Once per day, usually soon after 0:00 ]. Once per day, usually soon after 0:00 ].
<!--

===How do I view total views for longer than a month?=== ===How do I view total views for longer than a month?===
The default view is set for showing data for an article within the current month. To view an entire year's data, simply remove the month part from the URL ( http://stats.grok.se/en/201004/Tree → http://stats.grok.se/en/2010/Tree ) The default view is set for showing data for an article within the current month. To view an entire year's data, simply remove the month part from the URL ( http://stats.grok.se/en/201004/Tree → http://stats.grok.se/en/2010/Tree )


As of December 2010, the yearly data didn't aggregate properly. I (Killiondude) sent an email to Henrik about this on May 31, 2011 when it was . As of December 2010, the yearly data didn't aggregate properly. I (Killiondude) sent an email to Henrik about this on May 31, 2011 when it was .
-->


===Where is the data previous to October 2009 located?=== ===Where is the data previous to October 2009 located?===
Line 32: Line 33:


===Is the pageview data available in any other data format?=== ===Is the pageview data available in any other data format?===
Yes. You can see the original (source) data at http://dammit.lt/wikistats/. You can see ] formatted data by prepending <tt>/json/</tt> to the URL like so: http://stats.grok.se/json/en/200910/Michael_Jackson. Yes. You can see the original (source) data at http://dumps.wikimedia.org/other/pagecounts-raw/ (] of movement from ]' personal server to Wikimedia's database dumps). You can see ] formatted data by prepending <tt>/json/</tt> to the URL like so: http://stats.grok.se/json/en/200910/Michael_Jackson.
Raw data is also available at archive.org (see ]) and (]). Raw data is also available at archive.org (see ]) (]).


===What do these columns represent in dammit.lt's data sets?=== ===What do these columns represent in the original data sets?===
The format of these files are as follows: <project> <page name> <access count number> <transfer size in bytes> The format of these files are as follows: <project> <page name> <access count number> <transfer size in bytes>


Line 57: Line 58:
* ]: articles with biggest view increases (only Misplaced Pages) * ]: articles with biggest view increases (only Misplaced Pages)
* Last 30 days, also works when stats.grosk.se is down. For Misplaced Pages English. For Misplaced Pages German: * Last 30 days, also works when stats.grosk.se is down. For Misplaced Pages English. For Misplaced Pages German:
* provided by ] used for third party programs or analyzing (Henrik's source); see also ] * used for third party programs or analyzing (Henrik's source); see also ]

Revision as of 00:11, 9 March 2012

This page is a work in progress

Frequently Asked QuestionsThis page serves to document frequently asked questions regarding Henrik's Misplaced Pages article traffic statistics tool.

Is it case sensitive?

No.

Are redirects included in the data for a specific article?

No. One would need to look up each redirect's hit statistics.

How can I find out the top viewed pages for any given project?

View a statistics page for any article on the desired project. Then change the URL manually to replace the date and article name with the term top. Example: http://stats.grok.se/en/200912/Special:Searchhttp://stats.grok.se/en/top

Note that this information is not updated on a regular schedule. It is performed by Henrik (at least somewhat) manually.

How do I see stats for this month? A link doesn't show up!

You can change the URL manually to this month's numerical code (January = 01, February = 02, and so on). An example would be http://stats.grok.se/en/201004/Tree where 2010 is the year and 04 is the month of April.

How do I see stats for the past 30 days?

Use the format http://stats.grok.se/en/latest/Tree which will always be the current previous 30 days.

How often are the stats updated?

Once per day, usually soon after 0:00 UTC.

Where is the data previous to October 2009 located?

There is a set uploaded to the Internet Archive located here.

Is the pageview data available in any other data format?

Yes. You can see the original (source) data at http://dumps.wikimedia.org/other/pagecounts-raw/ (announcement of movement from domas' personal server to Wikimedia's database dumps). You can see JSON formatted data by prepending /json/ to the URL like so: http://stats.grok.se/json/en/200910/Michael_Jackson. Raw data is also available at archive.org (see this list) (announcement).

What do these columns represent in the original data sets?

The format of these files are as follows: <project> <page name> <access count number> <transfer size in bytes>

Are sisterprojects included?

Starting with 20080517-100000 other projects than Misplaced Pages are also included in the raw data, but not visibly in the interface; except for Meta (e.g. http://stats.grok.se/meta/201005/Main_Page), Commons (e.g. http://stats.grok.se/commons/201005/Main_Page) and sv.source (e.g. http://stats.grok.se/sv.s/200904/Huvudsida) the link to the page you're seeing the statistics of is broken. The code to be used in the url is the same as in raw data, so <lang>., www.w for Misplaced Pages portal, incubator.m, species.m.

Note: More detailed information about the format of URLs available here: http://www.archive.org/details/wikipedia_visitor_stats_200712

Why are figures so low?

«A significant percentage (about a third) of pageviews weren't being logged due to packet loss on the aggregating server.» ( Pageview data lost to packet loss) The problem possibly started on November 2009 and has been corrected in late July 2010.

In December of 2011 there was also loss of data on Wikimedia's part.

See also