Jump to content

Úsáideoir:Guliolopez/VisitorStats

Ón Vicipéid, an chiclipéid shaor.
BETA
With apologies for using English

Sources of stats

[cuir in eagar | athraigh foinse]

There are several available sources for stats/analytics on the Irish Wikipedia project. The main ones of note are stats.grok.se and the "official" wikimedia dumps.

See Henrik's project at stats.grok.se. From here one can:

  • See a list of pages based on hit ranking. (NB: Unfortunately this hasn't been updated since December 2010)
  • See stats for individual pages. For example An Astráil or Príomhleathanach (NB: Unfortunately Irish isn't officially supported via the forms on the page, so you'll need to "manually" edit the date and/or article name in the URL. But you can see the hits for each page up until this current month).

Official Wikimedia stats

[cuir in eagar | athraigh foinse]

See official stats at stats.wikimedia.org. From here one can:

Historic ranking

[cuir in eagar | athraigh foinse]

Hit count traffic stats for the Irish language Wikipedia (December 2010)

[cuir in eagar | athraigh foinse]

Extract from stats.grok.se:

  1. 23142 Príomhleathanach
  2. 6295 AJAX
  3. 3914 Speisialta:RecentChanges
  4. 2753 Idirlíon
  5. 1907 Special:Statistics
  6. 1576 Vicipéid:Lárionad comhphobail
  7. 1370 Speisialta:Random
  8. 1305 Vicipéid:Cabhair
  9. 1282 Vicipéid:Polasaí príobháideachta
  10. 1269 Vicipéid:Maidir leis
  11. 1244 Vicipéid
  12. 1122 Vicipéid:Séanadh ginearálta
  13. 1077 Cabhair:Clár ábhair
  14. 927 Gaeilge
  15. 863 Speisialta:SpecialPages
  16. 853 Vicipéid:Cúrsaí reatha
  17. 755 Stáit Aontaithe Mheiriceá
  18. 753 Brollach
  19. 735 Arán
  20. 721 An Béarla
  21. 701 An Ghaeilge
  22. 700 1956

Hit count traffic stats for the Irish language Wikipedia (February 2009)

[cuir in eagar | athraigh foinse]

Ran this again as the stats.grok.se report is still stuck on last August.

  1. 12069 Príomhleathanach
  2. 7515 Speisialta:Search
  3. 6626 AJAX
  4. 5690 Speisialta:RecentChanges
  5. 3464 Speisialta:Random
  6. 1571 Gaeilge
  7. 1225 Cursaí_reatha
  8. 964 Vicipéid
  9. 875 Blag
  10. 813 Stáit_Aontaithe_Mheiriceá
  11. 689 Cabhair:Clár_ábhair
  12. 646 Béarla
  13. 591 Éire
  14. 559 XML
  15. 544 Comhbhrú sonraí
  16. 521 Idirlíon
  17. 517 Fraincis
  18. 514 Poblacht_na_hÉireann
  19. 463 Barack_Obama
  20. 430 An_Fhrainc
  21. 399 2005
  22. 391 Vítneam
  23. 390 Ceadúnas GNU do Dhoiciméadú Saor
  24. 382 Gréasán_Domhanda
  25. 379 Bogearraí_saora

Hit count traffic stats for the Irish language Wikipedia (October 2008)

[cuir in eagar | athraigh foinse]

I probably don't need to run these reports manually any more. As the same detail is now seemingly published at stats.grok.se.

See: stats.grok.se/ga/top

Hit count traffic stats for the Irish language Wikipedia (August 2008)

[cuir in eagar | athraigh foinse]

Raw results posted here. Top 25 below. No major differences from June. (See /ComparisonTable). Though why "22 Bealtaine" so popular, am unsure :)

  1. 11391 Príomhleathanach
  2. 6973 Speisialta:RecentChanges
  3. 5368 AJAX
  4. 4740 Speisialta:Search
  5. 2386 Speisialta:Random
  6. 1910 Cursaí_reatha
  7. 1425 Gaeilge
  8. 1408 Vicipéid
  9. 1228 Blag
  10. 1138 Stáit_Aontaithe_Mheiriceá
  11. 971 2006
  12. 970 XML
  13. 913 Cómhaoin_Wikimedia
  14. 839 Vicí
  15. 834 Béarla
  16. 833 Cabhair:Clár_ábhair
  17. 742 Speisialta:Upload
  18. 734 Idirlíon
  19. 719 MP3
  20. 666 Ceadúnas_GNU_do_Dhoiciméadú_Saor
  21. 664 2008
  22. 662 Fraincis
  23. 634 An_Fhrainc
  24. 615 22_Bealtaine
  25. 604 Podchraolachán
Trends/Zeitgeist

Total "hits" down from 436,186 in June to 395,350 in August. (Cause: Schools off?)

Selected big article hit increases:

  • Tbilisi up from 20 hits in June to 71 hits in August (Cause: South Ossetia conflict?)
  • MP3 up from 197 to 719 (Cause: Not a clue)
  • Inis_Iocht up from 19 to 77 (Cause: Not a clue)
  • Cork_City_F.C. up from 25 to 107 (Cause: News? [Financial difficulties])
  • Aleksandr_Solzhenitsyn up from 68 to 299 (Cause: Death. Front page article.)
  • SSL up from 102 to 449 (Cause: No idea)
  • Louis_XIII_na_Fraince up from 5 to 29 (Cause: No idea)
  • Deichniúr up from 16 to 107 (Cause: No idea)
  • Eoin_Shasana up from 27 to 218 (Cause: WAYY UP - no idea why...)
  • C up from 15 to 140 (Cause: ???)
  • Cathal_Ó_Sandair up from 2 to 19 (Cause: Was interwiki-ed from EN project in early Aug)
  • Josip_Broz_Tito up from 3 to 36 (Cause: Dunno)
  • Aosdána up from 16 to 198 (Cause: No clue. Increase all came in one day.)
  • Nollaig_Ó_Gadhra up from 21 to 306 (Cause: Death/news)
  • Zeeland up from 2 to 39 (Cause: Dunno)
  • Liam_Concar up from 4 to 213 (Cause: WAYY UP - no idea why...)

Hit count traffic stats for the Irish language Wikipedia (June 2008)

[cuir in eagar | athraigh foinse]
  1. 29214 Príomhleathanach
  2. 7523 Speisialta:RecentChanges
  3. 6044 Speisialta:Random
  4. 4989 AJAX
  5. 4651 Speisialta:Search
  6. 2115 Cursaí_reatha
  7. 1593 Gaeilge
  8. 1383 Vicipéid
  9. 1231 Blag
  10. 1048 Stáit_Aontaithe_Mheiriceá
  11. 1008 2006
  12. 950 Cabhair:Clár ábhair
  13. 931 XML
  14. 914 Béarla
  15. 907 Vicí
  16. 865 Ceadúnas GNU do Dhoiciméadú Saor
  17. 806 Ryanair
  18. 725 Speisialta:Upload
  19. 725 2005
  20. 691 Cómhaoin_Wikimedia
  21. 683 Éire
  22. 670 Podchraolachán
  23. 658 An_Fhrainc
  24. 643 Idirlíon
  25. 596 2008

I asked myself a few questions the other day:

  • "For all the effort that's being put into this project (by me and others), is it worthwhile? IE: Is anybody actually reading all this stuff?"
  • "If so, what are the most popular pages?"
  • "Is it possible to identify whether the Irish language Wikipedia project is any more popular than other Irish language sites? IE: Does this project attract more visitors/hits than (say) abair.ie, or beo.ie or whatever?"
  • "Could we find out? How?"

And I set myself a task to see how many of these questions I could answer.

There are several Wikimedia project tools which track (serverside) content volumetric stats on the various projects. Including the Irish one. Like stats.wikimedia.org which has tools which show the size of database, most editted articles, most prolific editors etc. Or this charting tool which shows number of new articles, number of edits, etc. (Over time)

But these only shows us how busy the active "editting community" is. It doesn't really show us how busy the normal "viewing community" is.

There are a few web based hit/visitor tracking tools associated to the Wikimedia project. That do give the visitor stats detail. But few/none that actively capture data on the GA project.

There is one exception: stats.grok.se. This has some nice reports - like the "Top visited pages" on the EN project. This looks like the kinda thing I was thinking about. But it doesn't (yet) have a list for the GA project.

It does allow you to view stats for individual pages by searching on EN and then changing the project identifier in the URL. IE: search stats.grok.se/en/200806/Éire and then change to stats.grok.se/ga/200806/Éire.

But the top list I wanted was missing. So I decided to create on myself. And I wrote a script to do it. Results are below.

Results (June 2008)

[cuir in eagar | athraigh foinse]

The top 25 "most visited" articles for the entire month of June 2008 are listed below. I've put the full results - sorted top down - as a subset of this page. (With apologies for how fadas/etc display. Is an aspect of the encoding system I used for the script.)

  1. 29214 Príomhleathanach
  2. 7523 Speisialta:RecentChanges
  3. 6044 Speisialta:Random
  4. 4989 AJAX
  5. 4651 Speisialta:Search
  6. 2115 Cursaí_reatha
  7. 1593 Gaeilge
  8. 1383 Vicipéid
  9. 1231 Blag
  10. 1048 Stáit_Aontaithe_Mheiriceá
  11. 1008 2006
  12. 950 Cabhair:Clár ábhair
  13. 931 XML
  14. 914 Béarla
  15. 907 Vicí
  16. 865 Ceadúnas GNU do Dhoiciméadú Saor
  17. 806 Ryanair
  18. 725 Speisialta:Upload
  19. 725 2005
  20. 691 Cómhaoin_Wikimedia
  21. 683 Éire
  22. 670 Podchraolachán
  23. 658 An_Fhrainc
  24. 643 Idirlíon
  25. 596 2008

Notes:

  • I included some of the "special" pages in the count, but did not run the script against the category, image, talk, user or other namespaces. But I might at some point.
  • The stats are the actual "hit counts". Not individual page displays. So, as an example, a redirect will be counted on its own. And not as part of the "main" article. IE: Aston Villa (a redirect) shows 7 hits, but Aston Villa F.C. shows 55. I didn't add these up to reflect "total hits for Aston Villa" or anything like that.)
  • Some of the results are interesting. Like the fact that the AJAX article is up near the top. Nearly five thousand hits? For such a short article? That is only linked from one other article? Traffic must be driven to it from another site. Possibly the interwiki link on the equivalent EN page. Or some Google or other search engine that ranks it highly in searches for "AJAX" for some reason.

A count of the total "hits" to the article namespace shows 436186 hits over 8966 articles. That's about 48.6 hits per "page" for the month of June. (Including redirects, stubs and other scraps). This is hardly a scientific measure of popularity. But it's a start.

I might (at some point) try and (a) improve this count accuracy. And (b) find a way to compare to other Irish lang sites.

I might also - every now again - run this script for updated numbers. (It won't be very often though. Because it takes hours to complete.) At some point (hopefully) the stats.grok.se TOP service will be run against the GA project. In a better way than I can/do.

I might also put a "quick stats" summary someplace also. (Total hits per month, top pages, etc).

Maybe.

Should we use these stats to identify (for example) which articles to extend/improve/etc? EG:

  1. Given that AJAX is (for some inexplicable reason) one of the most popular pages, should it be extended? (Does anyone know enough to extend it?) Same goes for XML, Cómhaoin Wikimedia and Blag.
  2. Given that Stáit Aontaithe Mheiriceá is also up there with the most visited, should we clean it up a bit? It's one of the most visited pages, and yet it has had a cleanup tag for a year or more. Same goes for Ryanair.
  3. I think someone already brought this up recently, but, given that the Cursaí reatha pages are so popular, should someone "adopt" them to ensure they are consolidated a bit, cleaned-up and then kept somewhat up to date? (Weekly?) Or should we just retire them as obsolete to Wikinews/etc?

Excellent work, Guliolopez, and certainly some interesting reading. We should definitely use this info to perhaps focus our efforts a little bit, to make sure that we are covering what people seem to be after, and that there's a decent standard in the articles that are most viewed. Maybe all of us who are interested could take certain articles and expand and/or clean them up? About the cúrsaí reatha - I think that it's a worthwhile section, but of course it's pointless or even counter-productive if it's not somehow up-to-date. The problem is probably simply the fact that nobody "owns" it, so nobody feels the need to look after it. --Antóin 17:57, 15 Iúil 2008 (UTC)