Yes, the data collection push is out of control.
Data collection issues and privacy rights were the last thing on my mind, until last April, when I learned what Common Core was (besides educational standards that are communizing America’s education). When I learned that common core tests gather kids’ data that is nonacademic, personally identifiable, and longitudinal –meaning it goes from preschool through adulthood and is tracked by the government and researchers who will not need permission to study it– I was horrified. But the data collection desperation of agencies worldwide, continues. For example:
- Just this morning I got an email from a company that contracts with a company I work with to translate foreign documents. They wanted to purchase –in any language– full blogs, full email accounts, and other writings, for a secret client that they said needs a lot of data to practice a new spellchecker. Nuts! (I’ll post the full “job” email* at the bottom.)
- This week, I learned about a German man, Malte Spitz, now an international data privacy freedom fighter. Here’s part of his story (for full text: http://www.huffingtonpost.com/2012/07/25/malte-spitzs-ted-talk-_n_1701775.html )
In 2006, the EU issued the Data Retention Directive, Directive 2006/24/EC. This allowed European phone companies to store user data for six months to two years — including phone numbers, addresses, the times emails and data were sent, as well as users’ locations. Since then, several countries have either rejected or declared unconstitutional this legislation. In 2010, Germany’s Federal Constitution Court suspended the directive, calling it “inadmissable.”
The directive does state that the content of users’ text and voice conversations are not to be stored.
Police agencies could request information from mobile phone companies to access user data, but only via the court system. Spitz filed a suit against his phone company Deutsche Telekom in order to receive his own stored data.
After reaching a settlement, Spitz received a CD of his records in the mail. “At first I thought, okay — it’s a huge file,” he said, “But then I realized, this is my life. This is six months of my life […] You can see where I am, when I sleep at night, what I’m doing.”
- Then there’s Joanne Weiss, Chief of Staff of the U.S. Dept. of Education, who openly admits to “data-mashing,” meaning blending the databases from different federal agencies. She also has said she wants to be helpful to states who want to “partner” and share data.
- Then there’s John Brandt, our Utah Technology Director, CCSSO chair, and NCES member (translation: he’s a fed). He openly admist on his powerpoint online, that the Dept. of Education can be one of the recipients of Utah’s inter-agency data mashing.
- Then there’s “Communities that Care,” a nice-sounding euphemism for a federal lure to give up local data via a program that on the surface, is all about preventing teen drug use and crime. But it’s also a way for the federal government to access what we are thinking, both via ongoing youth surveys, and via archived family and individual data kept by the city.
- My own doctor said that he was offered thousands to share data with the government about his patients. He opted not to accept the money because he believes in patient privacy.
Why are governments so desperate to gather so much private data on citizens? So desperate that they’re overriding Congressional FERPA laws, so desperate that they’re cutting out parental consent.
To read more about this topic:
Department of Education Being Sued for Invasion of Privacy: http://epic.org/apa/ferpa/default.html
Oregon Senator’s Website: http://www.merkley.senate.gov/newsroom/press/release/?id=457f640a-2995-49c4-b386-27ca44c639a8
Federal Surveillance of data via Common Core tests: http://www2.ed.gov/programs/racetothetop-assessment/sbac-cooperative-agreement.pdf
TrapWire Surveillance: http://thenewamerican.com/usnews/constitution/item/12473-trapwire-the-federal-govt-is-literally-watching-every-move-you-make http://thenewamerican.com/tech/item/12635-trapwires-alleged-corporate-and-government-connections-grow
Many thanks for your interest in our program and for providing your experience in translation. Unfortunately we are not looking for a translation service at present; however, as mentioned in our advert we are collecting many versions of data on behalf of a client of ours. This data will be used to assist them in the development of their language tools. If this is something which you think you can assist us in, then please review the details below.
Below you will find some frequently asked questions which will provide you with more data on the program. Please read carefully to check if your language is available.
Note: We are only accepting languages which are available on the list at present.
We aim to collect a large amount of data for each language, so we hope we can collect a minimum of 150,000 words from each person participating. If you think you can reach this number, please let us know. If not, then please continue to save your data and contact us again in the near future.
Unfortunately everyone who contacts us may not be able to join this program, however, if you do know of someone that has their language included, please pass our information to them. We encourage all people to review their language / data.
On reading the FAQ, please reply and let us know what type of data / language you can provide to our program. We can then work on the collection process.
Please note, we do allow participants to donate more than one language if available.
We look forward to working with you.
Kind Regards, Lionbridge Data Collection Group
1) What languages are available? In our program we are now looking for the following languages: English UK, English US, Basque, Bulgarian, Croatian, Estonian, Finnish, Galician, Hungarian, Kazakh, Lithuanian, Romanian, Serbian (Latin and Cyrillic), Slovak, Slovenian, Turkish, Ukrainian, Arabic (Standard), Brazilian Portuguese, Chinese (Simplified and Traditional), Czech, Dutch, French, German, Greek, Italian, Japanese, Latvian, Norwegian, Polish, Portuguese European, Spanish European, Swedish, Indonesian, Latin American Spanish, Danish and Thai.
2) What if my language is not on the list? We are beginning with the languages listed above. However, we may begin collecting for your language in the future. Please begin to save your emails / reports etc. Also, you may know of a friend / colleague who may be able to join now. If so, then pass on our information to them.
3) Who gets my data? We are collecting all data in conjunction with a client who requires a large amount of words to help develop their language tools e.g. spellchecker. No other party will have access to your data
4) What data can I include? a. Email – you can include personal emails which you have written in your own language b. Reports – If you are at college, you can include draft reports which you have written for college (i.e. these are the first writings of your reports, not the final delivered version to your lecturer). If you are a journalist, you can include drafts of articles you have written. Note draft articles should contain both grammar and spelling mistakes i.e. they are not proof read. c. Letters – any letters which you have written in your native language d. Blogs – If you have created a blog and write regular updates, this could be included.
5) If I send email, what happens if I include personal email? Once you send us your email, we will first change all of the email addresses and numbers to email@example.com <mailto:firstname.lastname@example.org> and 000 to remove any personal identification. Your name / signature however will remain on the email if included.
6) Can I use any email account? Yes you can use most email accounts which can be setup either on the internet or at home. Note we are having some issues with exporting from yahoo.
7) How much data to I need to send you? We are looking to collect 600,000 words from each person; however we understand that this is a lot of data for one person. Therefore to assist you we are willing to receive as low as 150,000 words: – On average 2,000 emails. – 200 pages
8) What if I do not have enough data? Don’t worry if you don’t have enough data right now. You can begin to save your data and join our program at a later date. Also, remember, if you have emails and reports, you can join both to reach the required number. We can help you with this.
9) How long do I have to collect the data? We appreciate it can take time to get this detail together and to assist you we will be providing step by step instructions. This program is running until September 30th 2012.
10) Do I get paid for my data? Yes you do! For every 100,000 words you send to us, we will pay you $110.
11) How do I know my data is secure? On acceptance of your data, you will sign a data release form to say that our client can now use your data. No other party will have access to your data.