Welcome to WebmasterWorld Guest from 188.8.131.52
Forum Moderators: not2easy
I'm not talking about newspaper articles and their content. Just a bunch of numbers that get reported weekly. I'm planning on collecting them and publishing some reports based on the collected data.
Quoting the source is not mandatory but very much recommended.
On one hand, doing so invariably adds weight to the information you're presenting, provides provenance, and, if used properly, makes people think you really know what you're talking about.
On the other hand, citing sources lets you off the hook should anyone claim the data is actually a load of drivel!
When statistical data is published in the newspapers, are they also covered by copyright or do they become public domain information?
What type of data? Is the newspaper just repeating data they require from a public domain source or is the newspaper developing the data?
If this is just public domain information, it would be better to go directly to the source and not copy the newspaper.
Would they somehow be able to determine that I am not legally allowed to use the data that they have put some effort into collecting?
The express of the facts are covered by copyright. However the raw data of the facts is not..
However beware, the AP has been trying to use "Hot News" mis appropriation. (many say against fair-use or law) to claim ownership of the facts.
Standford Univ. has a great copyright site. Also a google search for AP "Hot News" may prove for some interesting conversation.
...is the newspaper developing the data?
It does not matter who develops the data, once it is released it is public information. Yes, it would be nice to go to source to ensure that all the data sets are available and to ensure accuracy of reporting, but that does not affect anything in regards to copyright: the information is still in the public domain.
Again, if concerned - although there's no reason to be - just ensure that you cite the source.
Three quarters of people surveyed believe that 75% of statistics are useless. Source: Syzygy.
[edited by: Syzygy at 4:44 am (utc) on June 24, 2008]
Well, the data is definitely based on facts (and you're right, how can you copyright facts?)
"Based on facts" can be very different from "facts." There is factual information that not only can be protected by copyright but also constitutes protected trade secret.
...and it would be easier for me to just use what they have published.
Not to be too dramatic, but instead of working, it would be easier for me to just steal money from those who do work. That doesn't make it ethical or legal.
Would they somehow be able to determine that I am not legally allowed to use the data that they have put some effort into collecting?
Which are you more concerned about, getting caught or doing the right thing?
I had a client who developed and sold lists of certain information based on "facts" obtained from required government filings. Although the basic facts couldn't be copyrighted, their presentation of the facts certainly could and was. They regularly "seeded" their presentation and it made it very easy to see who was copying their work.
Have no fears. There are no strings attached to the publishing of facts.
Yes there are strings attached.
The whole point of this type of data is not just that it is in the public domain, but that it's been released by the collators specifically for the public domain.
Again, I disagree.
On top of that, all compilers of surveys and facts love to see their work quoted - that's the whole point of it.
Once again, I disagree with this as a blanket statement.
This is one of the oldest rationalizations in the book and probably the one that gets people into trouble the most often.
It does not matter who develops the data, once it is released it is public information.
Farmboy, I see your point, but money isn't in the public domain - facts are. You say that you disagree, but you don't explain why. How can you copyright a fact? Sure, you could publish facts and copyright the publication, but the facts themselves still remain in the public domain.
I would cite the source - that gives the facts an air of authority.
...but you don't explain why.
Those who want to do things honestly will turn to authorative sources rather a discussion on a message board.
For those who want to be told it's OK to just take the easy road, trying to explain to them is an exercise in futility.
The OP seemed to be seeking opinions/feedback instead of affirmation. Various opinion and feedback has been offered. It's up to the OP to decide what do do now - that's the most anyone should expect from a message forum.
The OP hasn't been very forthcoming. All we know is that it is some sort of statistical data that, in some cases people are hired to collect, in other cases it is paid for.
Best advice,jmorgan, is to pay for an hour of an attorney's time.
When statistical data is published in the newspapers...
There is no moral argument over what is right and wrong here, this is not about honesty: published facts are in the public domain.
How and why do you think it's in the newspaper in the first place? Because both the organisation(s) compiling the data and those printing it want the facts made public!
75% of all statistics are made up on the spot.
Glad someone got my joke :-)
That "based on" can run a whole gamut of things and unless you or we know what it is we can't make an even uneducated guess as to what's what.
Copyright protects the particular way an author has expressed himself; it does not extend to any ideas, systems, or factual information conveyed in the work.
The safest course is always to get permission from the copyright owner before using copyrighted material. The Copyright Office cannot give this permission.
Of course, the key issue is at what point it is a fact and when it becomes presentation.
all compilers of surveys and facts love to see their work quoted - that's the whole point of it.
published facts are in the public domain.
The whole point of this type of data is not just that it is in the public domain, but that it's been released by the collators specifically for the public domain
again, this rests on a misunderstanding of what is meant by the term "public domain." Surveyers and reporters of statistical facts are usually not as altruistic as you seem to think.
A newspaper runs a snippet of statistical data: "5% of all cats hate their owners". A magazine reports on a survey of growth in fmcg and quotes that "the rise in the consumption of chocolate bars continues at £5 million per household per quarter".
Another source - a magazine, another newspaper, a TV show - wants to use that information. Where are the copyright issues exactly?
The media has been doing it since statistical data was first bandied about in its pages. Publications love to publish 'factoids' - little snippets of information, usually statistically based - to add interest to features and articles.
I've done it for years in magazines. Also, actually I've seen statistical data that I've produced being freely reproduced from one media channel to the next, so I've seen at first hand how statistical information can spread. It's the way things are.
Some research companies make fortunes doing exactly as jmorgan described in his original post and in-house marketing depts, analysts, reporters, authors and a plethora of others do it!
There are no issues in the context described!
I've never seen something so simple become so clouded and confused!
just to show that intuitions might not be right...
United States government statistics are Public Domain, while Australian government statistics are Copyright.
UK Government statistics may be used for private study or research purposes without prior approval. Using those for any other purpose requires a license.
Further, stats on the UK site produced by third parties can't be copied without authorization of the copyright holders. [statistics.gov.uk...]
Of course, those who want to take the easy road and seek to offer rationalization will copy those statistics at will, put them on a site to make money and call it "research."
And they will call every bit of compiled data, statistics, etc "facts" in an effort to excuse their activities.
This is why I mentioned above that conversing with these people is basically an exercise in futility.
Thieves are often talented at making excuses.
Here's a lengthy treatment of copyright protection for databases, facts, statistics, etc. from the US Copyright Office's site - [copyright.gov...]
The issue is not about whether one can freely take protected statistical data from sources like the UK Statistics Office without their permission and reproduce it copyright free. Nor are we discussing the relevancy of copyright as it pertains to databases/compiled data, etc.
What we're looking at is information that has been freely and readily introduced to the public through being published in the media.
Here are the search results for news stories containing the word 'survey' on Google News [news.google.co.uk] in the past month.
I'm seeing 133,695 stories that contain something about a survey. Not all will be about the type of survey we're looking at here, but rest assured many will.
By making such data available to the media, those who distribute it - in the form of press releases, reports, briefings, etc, - are allowing those whom they supply the information to, and the rest of the world, including their dog, the right to quote or use that information.
Here are some facts that I've just taken from General Motors:
General Motors Corp. (NYSE: GM), the world’s largest automaker, has been the global industry sales leader for 76 years. Founded in 1908, GM today employs about 280,000 people around the world. With global headquarters in Detroit, GM manufactures its cars and trucks in 33 countries. In 2006, nearly 9.1 million GM cars and trucks were sold...
I've not infringed anyone's rights by reproducing the data here.
GM might be concerned that I've reproduced their info word for word - copied it - but beyond that there are no issues whatsoever.
Here are some more statistical facts, this time taken from a current BBC story on US interest rates [news.bbc.co.uk]:
In the housing market, property prices fell by their fastest rate since 2000, according to the Case-Schiller home price index released on Tuesday.
Prices in the 20 cities it monitors were 15.3% lower in April compared to the year before.
The narrower 10-city index was 16.3% down, its biggest decline in its more than two-decade history.
If I use these publicly released figures on my site or in a magazine, or indeed anywhere, I am not infringing anyone's copyrights. Or is it a case that in fact it's the BBC who is doing the infringing as they're using someone else's data..? Come on now!
Information of the kind we are discussing - statistical data - released to and published by the media is public information.
Stuff like statistical data spreads across the internet like a bad cold, especially when it's good: one site publishes it, another pics up the story and runs with it - and uses the same, or even more data. Other sites take an interest and soon your figures are everywhere, if you're lucky.
If it's in the news and its a fact then that fact is in the public realm. And we haven't even mentioned the US principle of Fair Use yet...
[edited by: Syzygy at 6:13 pm (utc) on June 25, 2008]
It's a fair use. Law changes in different countries.
If you're going to run a big statistic website aimed at collecting non-US gov't statistical report, i'm sure there'll be potential troubles with it.
I'm planning on collecting them and publishing some reports based on the collected data.
But if that's true, then so is this:
Nor are we discussing the relevancy of copyright as it pertains to databases/compiled data, etc.
...which would be the problem with this:
The Economist Intelligence Unit sends me a report every week. That's statistic, but they clearly note that I can reproduce a part of their list with their credit quoted, but never the whole list.
The list is their compilation and expression of the data, and that's what's copyrighted - not the data themselves. Reproducing the list is a completely different issue than using some of their statistics in your own report.
The link to the U.S. Govt. copyright site given by farmboy is a good one - but it's specifically about compilations, and repeats the basic statement that facts and ideas cannot be copyrighted. Some people misuse the term "research," but that doesn't mean true research never occurs.
I feel obliged to add that, after reading Syzygy's posts regarding copyright and copywriting for 3+ years, I find it odd to see him lumped together with people looking for "the easy road."
[edited by: Beagle at 6:28 pm (utc) on July 7, 2008]
An example of this is a list of telephone numbers, such as a directory, and what sort of creativity might have gone into that list.
An alphabetical list of phone numbers by the last name of the person is not protected according to the courts, because that is pretty much a standard way to list things in the english speaking world.
A listing by category, when you create the categories, can be protected in its expression. You can feel free to extract and use the "facts" from that directory, but it would but playing with fire to copy the directory structure. You are probably safe to copy the white pages, but they yellow pages would be risky.
I spend a lot of time and money tracking the sales of domains, websites and online businesses - the metrics that influence buying prices etc. I use that data to create formulae to predict the prices of sites that haven't yet come up for sale.
I wouldn't like anyone to completely reproduce the data I send out to a few "friends" about the stats collected, or the price predictions made using my data and my Intellectual Property (formulae). Quote one or two figures, yes, but with full disclosure about the source and perhaps an invitation for the reader to visit my site.