User talk:Thomas Wright Sulcer/sandbox11

From Citizendium
< User talk:Thomas Wright Sulcer
Revision as of 07:51, 20 March 2010 by imported>Thomas Wright Sulcer (added section)
Jump to navigation Jump to search

Why do CZ articles sometimes fail to appear in Google searches?

Double click the arrow button first in the "What links here" column, (so its organized numerically) THEN double-click the arrow button in the Google PageRank column. You'll see how "what links here" is highly correlated with PageRank.

For method, look below this wikitable for notes...

CZ article title Date created (eg 200707 = July 2007) CZ creator Article's Google rank (1=BEST) (9999=doesn't appear) Google PageRank 10/10=BEST 0/10=totally obscure # CZ articles "What links here" (est) Number of CZ revisions approved (a) or draft (d) # CZ contributors (people) (est) Notes
Terrorism 200712 D. Matt Innis 1 3/10 500 8 a 3
Lucille Ball 200812 Paul Wormer 4 0/10 13 4 d 3
Script kiddie 201002 Thomas Wright Sulcer 0/10 9 d
Dana Delany 201002 Thomas Wright Sulcer 8 0/10 1 2 d
Lady Gaga 201002 Thomas Wright Sulcer 9999 0/10 20 13 d 4
Bromance 201003 Thomas Wright Sulcer 9999 0/10 13 6 d 1
Albert Einstein 200611 Nancy Sculerati MD 1 3/10 180 42 d 17
Restructuring of the United States Army 200807 Howard C. Berkowitz 1 2/10 175 54 d 5
Solid harmonics 200708 Paul Wormer 1 0/10 29 22 d 3
Plane (geometry) 200703 John Roby Clayton 1 0/10 35 16 d 9
Gaussian elimination 200905 David E. Volk 9999 0/10 9 8 d 2 Note: this is what appeared first on google: "CZ:Core Articles > Mathematics" not the CZ article; this came third: "Talk:Gaussian elimination"
Linear algebra 200811 Barry R. Smith 1 2/10 35 2 d 1
Cloud computing 200807 Ashwath Ravichandran 4 2/10 50 117 d 6 CZ really first; three ads for CC came up first
Horticulture 200612 Nancy Sculerati MD 1 2/10 35 13 d 7 "CZ: workgroup requests" ranks 2nd
Business 200707 Ian Johnson 3 0/10 112 18 d 3 "CZ:Business Workgroup" was 2nd; the business article was 3rd -
Truth 200805 Larry Sanger 1 2/10 35 11 d 5 Jacques Derrida, and Aesthetics came up #2 and #3
Anthropology 200707 Chris Day 1 4/10 370 41 d 10 CZ:anthropology workgroup came up #9
Applied social sciences 200803 Roger Lohmann 1 2/10 28 15 d 6 Anthro:catalogs 3/10 biblio 3/10 rel art 3/10
Henry Ford 200704 Richard Jensen 1 3/10 45 41 d 10 HF's external links, and bibliography were #2 and #3
Nobel Prize 200810 Daniel Mietchen 1 3/10 140 9 d 4
Norway 200705 Kjetil Ree 2 0/10 135 56 d 4 The Norway "Gallery" came up 1st; a collection of images about Norway.
Great Britain 200704 Richard Jensen 1 3/10 150 39 d 6
United Kingdom 200611 Rilson Versuri 2 0/10 1000 163 d 22 What came first? "Parliament of the United Kingdom - encyclopedia article - Citizendium" Why? "Catalogs" came 3rd.
Scottish Parliament 200801 Gareth Leng 1 2/10 39 12 d 4
Royal Palace of Holyroodhouse 200805 Gareth Leng 6 0/10 16 7 d 2 First: Royal Mile article on CZ; second: Edinburgh (external links); third: Scottish Parliament; 4th = RP of H (related articles)
Arthur's Seat 200801 Gareth Leng 1 0/10 19 6 d 4 Scottish Parliament article #2; Royal Mile article #3; Arthur's Seat external links comes #7
Photosynthesis 200808 Anthony.Sebastian 1 2/10 90 108 d 6 Photosynthesis "Gallery" comes #2
Handyman 201002 Thomas Wright Sulcer 9999 0/10 12 24 d 2 CZ's Cottage industry comes 1st; Tool related articles comes #5
Guitar 200810 Kiryl Rusetski 1 0/10 125 10 d 7
Hierarchical clustering 200911 Daniel Mietchen 3 0/10 10 7 d 2 CZ article "Bioinformatics - encyclopedia article - Citizendium" comes 1st; the talk page of Hc comes about 8th.
Human 200710 Philippe Tusler 1 4/10 380 9 d 6
Chimpanzee 200802 Wilson Rondini IV 1 3/10 44 35 d 7
Money 200706 Anthony Argyriou 3 3/10 40 12 d 5 "Money supply" on CZ came up first, "Monetarism" came up 2nd
Money supply 200811 Nick Gardner 1 3/10 50 11 d 2

Notes

I looked at various CZ articles, created at different times by different people, looked at different variables such as # revisions, # contributors, creation date (sortable format, so later dates are numerically higher). For each, I typed this into a google searchbar, putting the exact article name in quotes:

"Exact article name" citizendium

What rank did each one get? The rank is the entry number; so if it appeared on the first page in the fourth slot, it got a rank of "4"; I counted subgroups too. This is my ranking by checking it out on google. If it didn't appear after a page or two of searching, I put 9999 meaning I didn't find it.

A second measure is Google's "PageRank" -- by plugging an article title into the Google PageRank tool, a number comes out -- 10 is highest, 0 is lowest.

The variable "what links here"; just go to any CZ article, and click on the box under tools that says "what links here". This is a rough guess based on scrolling through pages. I think this is an important variable.

My working hypothesis is: CZ's Related Article subpages CONFUSE Google's crawlers. They have trouble figuring what links to what. As a result, CZ's internal links are TOO COMPLEX and CZ's web presence is harmed as a result. Date is a big variable; organize it by date first, then article google rank second; there are no articles created in 2009 that achieved a #1 spot EVEN WITH THE WORD "CITIZENDIUM" in the search bar.

A recommendation which I think we should do: don't block google crawlers from user pages. The links from user pages to articles is a big source of strength for us. If there are some CZ users who want to have their user page "private" and unsearchable, then maybe some special arrangement can be made. But this seems to be an easy, straightforward recommendation that would help considerably.


Shell for adding more data:

| <!--Article [[]]---> || <!--YYYYDD created---> ||<!--CZ creator---> ||<!--Article Google rank---> ||0/10<!--Google's PageRank--->|| <!---#What Links Here---> ||<!--Number of CZ revisions---> ||d<!--Approved or Draft?---> ||<!--# CZ contributors (est)---> || <!--Notes---> |-