Are surface temperature records reliable?
The skeptic argument...
Temp record is unreliable
"We found [U.S. weather] stations located next to the exhaust fans of air conditioning units, surrounded by asphalt parking lots and roads, on blistering-hot rooftops, and near sidewalks and buildings that absorb and radiate heat. We found 68 stations located at wastewater treatment plants, where the process of waste digestion causes temperatures to be higher than in surrounding areas.
In fact, we found that 89 percent of the stations – nearly 9 of every 10 – fail to meet the National Weather Service’s own siting requirements that stations must be 30 meters (about 100 feet) or more away from an artificial heating or radiating/reflecting heat source." (Watts 2009)
What the science says...
| Select a level... |
Basic
|
Intermediate
|
Advanced
| ||||
|
The warming trend is the same in rural and urban areas, measured by thermometers and satellites, and by natural thermometers. |
|||||||
Surveys of weather stations in the USA have indicated that some of them are not sited as well as they could be. This calls into question the quality of their readings.
However, when processing their data, the organisations which collect the readings take into account any local heating or cooling effects, such as might be caused by a weather station being located near buildings or large areas of tarmac. This is done, for instance, by weighting (adjusting) readings after comparing them against those from more rural weather stations nearby.
More importantly, for the purpose of establishing a temperature trend, the relative level of single readings is less important than whether the pattern of all readings from all stations taken together is increasing, decreasing or staying the same from year to year. Furthermore, since this question was first raised, research has established that any error that can be attributed to poor siting of weather stations is not enough to produce a significant variation in the overall warming trend being observed.
It's also vital to realise that warnings of a warming trend -- and hence Climate Change -- are not based simply on ground level temperature records. Other completely independent temperature data compiled from weather balloons, satellite measurements, and from sea and ocean temperature records, also tell a remarkably similar warming story.
For example, a study by Anderson et al. (2012) created a new global surface temperature record reconstruction using 173 records with some type of physical or biological link to global surface temperatures (corals, ice cores, speleothems, lake and ocean sediments, and historical documents). The study compared their reconstruction to the instrumental temperature record and found a strong correlation between the two:
Temperature reconstruction based on natural physical and biological measurements (Paleo, solid) and the instrumental temperature record (MLOST, dashed) relative to 1901-2000. The range of the paleo trends index values is coincidentally nearly the same as the GST although the quantities are different (index values versus temperature anomalies °C).
Confidence in climate science depends on the correlation of many sets of these data from many different sources in order to produce conclusive evidence of a global trend.
Last updated on 13 January 2013 by dana1981. View Archives

Arguments

































Basic
Intermediate
Advanced







Peter, some of the links seems to be broken? Anyway, does your first chart represent published results or just your own analysis? If you are interested in Arctic surface station records have a look at Bekryaev 2010 which uses data from 441 high latitude and Arctic surface stations.
Like Berényi Péter, I also don't have a lot of time right now, being about to leave for vacation in a few days and having far too much to do. But I thought it would be worth putting up a quick example to illustrate the necessity of using some kind of spatial weighting when analyzing spatially heterogeneous temperature data.
Since BP uses Canada as his example, I'll do the same. He mentions a useful data source, the National Climate Data and Information Archive of Environment Canada. I'll use the same data source.
Since I want to get this out quickly, I'm just using monthly mean temperature data from July, and as another shortcut I'll just look at every 5 years (i.e., 2010, 2005, 2000, 1995, ...) I picked July because it's the most recent complete month and 5-year intervals for no particular reason. Maybe sometime later I can expand this to look at the complete monthly data set. In any case, using just one month per 5 year interval will make this analysis more "noisy" than it would otherwise be, but that's OK.
I then identified all stations with data in all years, and whose name and geographic coordinates were exactly the same in all years. There's just over 150 of them:
Note, first, that the stations aren't distributed uniformly.
Note, second, that the trends differ greatly in different regions.
In particular, note that there are a large number of stations showing cooling in inland southwestern Canada. There are also a lot of stations showing warming across eastern and northern Canada.
(This is an Albers conical equal-area projection, so the apparent density of stations is proportional to their actual density on the landscape).
If you calculate the trend for each station, and then just take the overall non-spatial average, you get a slight cooling of about -0.05C/decade for Julys in the 1975-2010 period.
But as the map shows, that's quite unrealistic as an estimate of the trend for the country as a whole! The large number of tightly-clustered stations in certain areas outweighs the smaller number of stations that cover much larger areas elsewhere.
To estimate the spatially structured temperature trend I used a fairly simple kriging method. This models a continuous surface based on the irregularly distributed station data. There are many other approaches that could be used (e.g., gridding, other interpolation methods, etc). Anyway, the spatially weighted trend across all of Canada is warming of +0.18C/decade.
So ... a naive nonspatial analysis of these data give an erroneous "cooling" of -0.05C/decade. A spatially weighted analysis gives a warming of +0.18C/decade.
This is why I keep telling Berényi Péter that his repeated attempts to analyze temperature data using simple, nonspatial averages are more or less worthless.
Again, this is based on a small fraction of the overall data set, and a not necessarily optimal methodology. But it's sufficient to show that using real-world data you can end up with seriously misleading results if you don't consider the spatial distribution of your data.
some of the links seems to be broken?
Yes, two of them, sorry.
Anyway, does your first chart represent published results or just your own analysis?
As I have said, it is my own analysis. But it is a pretty straightforward one using only public datasets. Really nothing fancy, anyone can repeat it.
BTW, the result, as you can see, is published (here :)
It is not peer reviewed of course. But since the quality of the peer review process itself is questioned in this field, it is a strength, not a deficiency. Any review is welcome.
have a look at Bekryaev 2010 which uses data from 441 high latitude and Arctic surface stations
You still don't get it. The Bekryaev paper is useless in this context, as it is neither freely available nor has its supporting dataset published. Therefore it is impossible to repeat their analysis or check the quality of their data here and now. Credibility issues can get burdensome indeed.
Obviously, stations in northern Canada are mostly warming faster than those further south. So, if you did use a non-spatial averaging method, dropping high-latitude stations would create an artificial cooling trend, not warming. Using gridding or another spatial method, the decline in station numbers is pretty much irrelevant (though more stations is of course preferable to fewer).
Thanks for fixing the links, though I think Ned has actually answered one question I had quite efficiently.
I'm not sure what it is I still don't get? (why so defensive?) Bekryaev lists all sources (some of them available for the first time), the majority with links, though I admit I haven't followed them all through. I am surprised you make comments without even looking at the paper. Anyway, I genuinely thought you might be interested.
Bekryaev lists all sources (some of them available for the first time), the majority with links, though I admit I haven't followed them all through.
Show us the links, please.
I am surprised you make comments without even looking at the paper. Anyway, I genuinely thought you might be interested.
I am. However, I would prefer not to pay $60 just to have a peek what they've done. I am used to the free software development cycle where everything happens in plain public view.
#104 Ned at 07:11 AM on 11 August, 2010
Obviously, stations in northern Canada are mostly warming faster than those further south
I see that. However, that does not explain the fact the bulk of divergence between the three datasets occurred in just a few years around 1997 while the sharp drop in Canadian GHCN station number happened in July, 1990.
Anyway, I have all the station coordinates as well, so a regional analysis (with clusters of stations less than 1200 km apart) can be done as well. But I am afraid we have to wait for that as I have some deadlines, then holidays as well.
I thought it would be worth putting up a quick example to illustrate the necessity of using some kind of spatial weighting when analyzing spatially heterogeneous temperature data
OK, you have convinced me. This time I have chosen just the Canadian stations north of the Arctic Circle from both GHCN and the Environment Canada dataset.
The divergence is still huge. Environment Canada shows no trend whatsoever during this 70 year period, just a cooling event centered at the early 1970s, while GHCN raw dataset is getting gradually warmer than that, by more than 0.5°C at the end, creating a trend this way.
No amount of gridding can explain this fact away.
Adjustment history is particularly interesting. It introduces an additional +0.15°C/decade trend after 1964, none before.
Your approach still gives the appearance of cherry picking stations. As I said previously, you need to make a random sample of stations to examine. Individual stations on a global grid are not informative, except as curiosities :)
Your approach still gives the appearance of cherry picking stations
You are kidding. I have cherry picked all Canadian stations north of the Arctic Circle that are reporting, that's what you mean? Should I include stations with no data or what? How would you take a random sample of the seven (7) stations in that region still reporting to GHCN every now and then?
71081 HALL BEACH,N. 68.78 -81.25
71090 CLYDE,N.W.T. 70.48 -68.52
71917 EUREKA,N.W.T. 79.98 -85.93
71924 RESOLUTE,N.W. 74.72 -94.98
71925 CAMBRIDGE BAY 69.10 -105.12
71938 COPPERMINE,N. 67.82 -115.13
71957 INUVIK,N.W.T. 68.30 -133.48
BTW, here is the easy way to cherry pick the Canadian Arctic.
Hint: follow the red patch.
If DMI (Danish Meteorological Institute) Centre for Ocean and Ice is visited, a very cool melt season can be noticed this year north of the 80° parallel (compared to the 1958-2002 average). It went below freezing two weeks ago (with the sun up in the sky 7×24 hours a week) and stayed there consistently. This is unheard of since measurements started.
Melt season is defined here as the period when 1958-2002 average is above freezing. It is 65 days, from 13 June to 16 August.
One wonders how exceptional this weather might be.
Therefore I have recovered average melt season temperatures for the high Arctic from the DMI graphs for the last 53 years. This is what it looks like:
It is pretty stable up to about 1992. Then, after a brief warming (a tipping point?) it dives into a rather scary, accelerating downward trend. So no, this year is not exceptional, just an extension of the last two decades.
It may even be consistent with recent ice loss of the Arctic Basin, because lower temperatures mean higher pressure, a predominantly divergent surface wind pattern around the Pole, hence increased export of ice to warmer periphery. Of course with further cooling this trend is expected to turn eventually.
However, there is one thing this downward trend is surely inconsistent with. It is the upward trend reported by e.g. GISS (US National Aeronautics and Space Administration - Goddard Institute for Space Studies) and the computational climate models it is calibrated to, of course.
This conflict should be resolved.
Pick a station in this high arctic set. Dig out the data needed for homogenization, follow the GHCN manual and show us where they went wrong. Just one station.
Nah, that would be cherry picking and excessive detail.
And you guess on probability that the consilience is wrong?
BP- and I will ask again. What do you think the probability of surface temp record, glacial ice volume, sealevel and satellite temperatures trends ALL being wrong so as to give us a false trend?
I can't assign a probability to that event, because the sample space is undefined. We have no idea what might or might not going on in the background.
But I would say it's likely in the ordinary sense of the word. In all these cases people are desperately looking for tiny little effects hidden in huge noise with predetermined expectation. Not the best precondition for objectivity.
At least the surface temperature record has serious problems with neglecting the temporal UHI effect due to fractal-like population distribution and quadrupling of global population density in slightly more than a century. If you subtract this from the trend, not much remains, leaving all the multiple independent lines of evidence inconsistent with each other.
never found out what the homogenisation procedure was
Listen, I am talking about adjustments done to raw data here. I thought homogenization is supposed to come later. Anyway, it is next to impossible to assess the validity of a procedure if truly raw data are not published.
How likely is it that Environment Canada stations needed an increasing upward adjustment starting in 1964 up to 0.9°C toward the end to make their way into GHCN raw dataset?
I don't think that's a reasonable suggestion.
Spencer & Christy are "skeptics" but their UAH satellite record is not dramatically different from RSS's version (+0.14C/decade vs. +0.16). Several of the recent "blog-based" replications of the GISTEMP/HADCRUT surface temperature record were done by "skeptics" or "semi-skeptics" ... but they don't show any difference from the mainstream versions.
If Greenland were gaining ice, or if the global mean temperature were falling over the 1979-2010 period, or if there were a reasonable way to process satellite altimetry data that showed sea levels declining ... somebody would have published it by now.
Do you seriously think Spencer & Christy haven't scrutinized their methods, looking for anything that could get them back to the (erroneous) cooling trend they got so much fame and attention for in the 1990s?
Sorry, BP, but that argument just won't fly.
As to GHCN. Do think it reasonable that stations going into the GHCN have temperatures corrected so that every station measures temperature on the same basis? THEN you worry about gridding etc. I think you should actually get the station data and the GHCN adjustment data from the station custodian. Why guess?
Do think it reasonable that stations going into the GHCN have temperatures corrected so that every station measures temperature on the same basis?
Definitely. That is, it would be reasonable, but unfortunately it is not what happens.
In reality data from GHCN stations inside the US of A go into the raw data file pretty much unchanged, then later on multiple adjustments are applied to them as they make their way to v2.mean_adj. The bulk of the 20th century warming trend for the US is introduced this way.
For the rest of the world an entirely different procedure is followed, where adjustments are hidden from the public eye. That is, for these stations the additional upward trend introduced during the transition from v2.mean to v2.mean_adj is next to negligible, but there are huge adjustments to data before they have a chance to get into the raw dataset.
Of course it is always possible to re-collect data from the original sources and make a comparison (that's what I was trying to do with Environment Canada and Weather Underground), but it is not a cost effective way to do the checking, that much you have to admit.
Worse, for most of the stations in GHCN there is no genuine raw data online (not to mention metadata) from the original source, so one would need a pretty extensive organization to do an exhaustive validation job of GHCN data integration procedures.
Take Wellington. Original station close to sea level. Then it was moved to met office on top of nearby hill. ("Proof of global cooling. Adjustments arent required"). Later it was moved to airport at sealevel. ("Conspiracy to create warming by moving station. Must make adjustment"). NONE of this history is apparent in the raw data. In fact none of it accessible via internet. Since you are so sure that a station has be incorrectly adjusted, then surely the way to prove this is get the adjustment procedure from custodian and check it against the GHCN manual. None of your graphs mean anything until basis for adjustment has been audited for individual station. You can claim a coup if you find just ONE piece of fraud, so surely worth effort of writing directly to custodian and a lot more cost effective than analysis that shows that adjustments are made - we know that. Papers written on what, how, and how effective these are.
no one doubts for a moment that data in the series has to be adjusted
Agreed. However, everyone with a basic training in science and a bit of common sense would doubt the right time for adjustments is before data are put into the raw dataset.
If it is done to numerous Canadian sites we can check by Environment Canada, there is no reason to assume it is not a general practice, also done to most stations there is no easy way to recover genuine raw data for.
The straight, simple and honest path would be not to do it ever, not in a single case. Include all the necessary metadata there along with truly raw measurements and do adjustments later, putting adjusted values into a separate file.
From the Tech Terms Dictionary:
Raw data
Raw data is unprocessed computer data. This information may be stored in a file, or may just be a collection of numbers and characters stored on somewhere in the computer's hard disk. For example, information entered into a database is often called raw data. The data can either be entered by a user or generated by the computer itself. Because it has not been processed by the computer in any way, it is considered to be "raw data." To continue the culinary analogy, data that has been processed by the computer is sometimes referred to as "cooked data."
Therefore it is a valid statement that the majority of data in GHCN are cooked.
If Environment Canada (are they the real custodian or the collection agency) says this the data as read from thermometer, then it raw. You have to have the metadata about the thermometer and station changes before you can do the adjustment procedures though. This is the what is missing from your analysis. I am pretty sure that GHCN "raw" data is the station-adjusted data ready for gridding. GHCN does not have the data for station series adjustment as fas as I know. This is done by custodial agency in NZ and I guess the rest of the world. It needs local knowledge.
I note that GHCN rejects station data for which the raw data for homogenization correction is not available, so in principle, you should be able to find all that. Since you think the adjustments must be wrong, then pick the station with highest adjustment and get the homogenization data for that. Repeat the procedure in Petersen et al
A STATISTICAL ANALYSIS OF MULTIPLE TEMPERATURE PROXIES: ARE RECONSTRUCTIONS OF SURFACE TEMPERATURES OVER THE LAST 1000 YEARS RELIABLE? McShane and Wyner.
Submitted to the Annals of Applied Statistics
One of the conclusions:
...we conclude unequivocally that the evidence for a ”long-handled” hockey stick (where the shaft of the hockey stick extends to the year 1000 AD) is lacking in the data.
In other words, there might have been other sharp run-ups in temperature, but the proxies can't show them. The hockey stick handle may be crooked, but the proxies can't show it one way or the other.
Is the hockey stick broken?
http://theinconvenientskeptic.com/2010/10/what-global-te…rement-is-best/
John Kehr
The Inconvenient Skeptic
Working link.
I did indicate that the satellite measurement is a measurement of wavelength. I am not saying that it is perfect method, but none of them are perfect. Hadley and CRU also give different results. This is the one place where anomaly is beneficial.
I think it is a more useful method than all skeptics using satellite only and the AGW crowd using CRU only. Instead of arguing about interpolation methods and UHI I am using more sources of anomaly data.
If you have a better proposal for incorporating satellite data into a standard record I am all ears. I don't particularly care what method is used, but a single set that attempts to use the station and satellite data would be helpful for all.
Also, the idea that "skeptics" use satellite and AGW use surface is bogus. It is use for what purpose.
As scaddenp rightly points out, you are in error. The atmosphere is layered, like an onion. The different dataset sources measure different things. Attempting to homogenize them into a "blended" dataset is less like comparing apples to oranges than it is comparing apples and breadfruit.
Attempting to shift the focus of the debate to "skeptics using satellite only and the AGW crowd using CRU only" is also misleading. Scientist use the theory that best explains the preponderance of the data. Multiple, independent lines of evidence (of which station data and satellite data are but two) show that our world is warming and that we are causing it. That is what science is telling us.
Most "skeptics" choose to focus on part of the evidence available rather than all of it.
I can appreciate wanting to roll all of the instrumental data (station and satellite) into one neat package, but it isn't necessary. It's rather like combining the four Gospels into one continuous narrative: while interesting, it doesn't tell us anything we don't already know.
The Yooper
I don't know where you get that impression. I tend to use the satellite record if I'm making a point about recent years, and the instrumental record if the topic is longer term.
You can find nice examples of comments where I used the satellite temperature record here and here.
Note also that if you click on the "Advanced" tab at the top of this page, then scan down to figure 7, you'll see a comparison of temperature reconstructions in which I averaged the various instrumental records to get a "surface" record, and averaged the satellite records to get a "lower troposphere" record.
And since this Global Warming has only this physical evidence (witll all else being ambiguous), then their argument FAILS.
To include the quote from that page:
“A single fact will often spoil a most interesting argument.” –William Feather
Are you reading this website (the point of a discussion is to, in fact, discuss), or just posting and walking away?
Kirk, if you want physical evidence for global warming, go here and also here and here. You might also truck on over to here if you want to get a grip on the physics of GW.
I've been hearing a lot about degraded NOAA satellites. Most of what I find on it is from
viciously slanted blogs
This MSU webpage was pointed out to me. It confirms some degree of difficulties with one or more NOAA satellites that resulted in some distorted thermal images.
I'm having trouble finding information on the temporal duration of the issues. I also don't know what data has been affected.
Does anyone have an answer to this challenge?
Thanks.
Hey I was at the University of Michigan Biological Station this June and July, and I toured the Upper Peninsula a bit.
Do you do research in Michigan?
Sorry, I no longer work in the Earth Sciences fields. In pharmaceuticals now, living where I want to live instead of doing the work I wanted to do & hating where I was living (Washington, DC).
If you want to chat via email, send it to John Cook here at Skeptical Science & he'll forward it to me.
The Yooper
Which brings up my question on the temp data sets, the HadCRU and GISS ones are the same thermometers, with different data adjusting procedures etc..., while the GSOD database has many more stations - my question is does it also include the GHCN stations (while adding many more), or is it a set of completely distinct stations? I couldn't find for sure from the links at Ned #90...Also, are there any other worldwide surface station data sets distinct from the GHCN that have been looked at?
Many thanks!!
You might like to look at Ned's post above #102 to help you guess whether the hadcrut method (use global average to interpolate) or GISS method (infer for local station analysis) might give best answer for Arctic.
If someone was averaging temperatures in the way you seem to think they are, then you would have a point. However, if you see Hansen 2008, the keepers of temperature record would agree and so that is NOT how it is done.
It has been pointed out to you before with the links to the actual method, so why are you persisting with this erroneous strawman?
"The Berkeley Earth Surface Temperature project is incorporating criticism of data collection sites"
And here is what the BEST project says as of today:
We are first analyzing a small subset of data (2%) to check our programs and statistical methods and make sure that they are functioning effectively. We are correcting our programs and methods while still “blind” to the results so that there is less chance of inadvertently introducing a bias.
The Berkeley Earth team feels very strongly that no conclusions can yet be drawn from this preliminary analysis. -- emphasis added
Best to wait until there's a finding before rushing to judgment. But then, you're reading Watt$.
http://data.giss.nasa.gov/gistemp/graphs/
The Annual Mean Temperature Change in the United States appears to have peaked and is dropping at the end of the graph?
http://data.giss.nasa.gov/gistemp/graphs/Fig.D.gif
Ask you self this. If you had seen that graph in the early 1990s - obviously only with the numbers up to then - would you not say the same thing? Would you have been right? Given which, do you think looking at the ups and downs over a time range of a couple of years is reliable?
In English, this translates to the info shown for the last 5 years on the graph is less certain and more variable (relative to that which preceded it).
The Yooper
I did find and read some nice Wikipedia entries on Climate records, controversies about same, and more about the Berkeley Earth Surface Temperature Project
http://en.wikipedia.org/wiki/Temperature_record
http://en.wikipedia.org/wiki/Temperature_record_of_the_past_1000_years
http://en.wikipedia.org/wiki/Instrumental_temperature_record
http://en.wikipedia.org/wiki/Berkeley_Earth_Surface_Temperature
Also found access to "Uncertainty estimates in regional and global observed
temperature changes: a new dataset from 1850"
http://www.metoffice.gov.uk/hadobs/hadcrut3/HadCRUT3_accepted.pdf
I would appreciate pointers to raw data that we can download ourselves.
Chris Shaker