The Mudcat Café TM
Thread #59418   Message #3604712
Posted By: Stilly River Sage
25-Feb-14 - 10:09 AM
Thread Name: BS: The Mother of all BS threads
Subject: RE: BS: The Mother of all BS threads
No surprise, I suppose, about this: http://www.nature.com/news/publishers-withdraw-more-than-120-gibberish-papers-1.14763

Publishers withdraw more than 120 gibberish papers



Conference proceedings removed from subscription databases after scientist reveals that they were computer-generated.

The publishers Springer and IEEE are removing more than 120 papers from their subscription services after a French researcher discovered that the works were computer-generated nonsense.

Over the past two years, computer scientist Cyril Labbé of Joseph Fourier University in Grenoble, France, has catalogued computer-generated papers that made it into more than 30 published conference proceedings between 2008 and 2013. Sixteen appeared in publications by Springer, which is headquartered in Heidelberg, Germany, and more than 100 were published by the Institute of Electrical and Electronic Engineers (IEEE), based in New York. Both publishers, which were privately informed by Labbé, say that they are now removing the papers.

Among the works were, for example, a paper published as a proceeding from the 2013 International Conference on Quality, Reliability, Risk, Maintenance, and Safety Engineering, held in Chengdu, China. (The conference website says that all manuscripts are "reviewed for merits and contents".) The authors of the paper, entitled 'TIC: a methodology for the construction of e-commerce', write in the abstract that they "concentrate our efforts on disproving that spreadsheets can be made knowledge-based, empathic, and compact". (Nature News has attempted to contact the conference organizers and named authors of the paper but received no reply; however at least some of the names belong to real people. The IEEE has now removed the paper).

How to create a nonsense paper

Labbé developed a way to automatically detect manuscripts composed by a piece of software called SCIgen, which randomly combines strings of words to produce fake computer-science papers. SCIgen was invented in 2005 by researchers at the Massachusetts Institute of Technology (MIT) in Cambridge to prove that conferences would accept meaningless papers — and, as they put it, "to maximize amusement" (see 'Computer conference welcomes gobbledegook paper'). A related program generates random physics manuscript titles on the satirical website arXiv vs. snarXiv. SCIgen is free to download and use, and it is unclear how many people have done so, or for what purposes. SCIgen's output has occasionally popped up at conferences, when researchers have submitted nonsense papers and then revealed the trick.

Labbé does not know why the papers were submitted — or even if the authors were aware of them. Most of the conferences took place in China, and most of the fake papers have authors with Chinese affiliations. Labbé has emailed editors and authors named in many of the papers and related conferences but received scant replies; one editor said that he did not work as a program chair at a particular conference, even though he was named as doing so, and another author claimed his paper was submitted on purpose to test out a conference, but did not respond on follow-up. Nature has not heard anything from a few enquiries.

"I wasn't aware of the scale of the problem, but I knew it definitely happens. We do get occasional e-mails from good citizens letting us know where SCIgen papers show up," says Jeremy Stribling, who co-wrote SCIgen when he was at MIT and now works at VMware, a software company in Palo Alto, California.

"The papers are quite easy to spot," says Labbé, who has built a website where users can test whether papers have been created using SCIgen. His detection technique, described in a study1 published in Scientometrics in 2012, involves searching for characteristic vocabulary generated by SCIgen. Shortly before that paper was published, Labbé informed the IEEE of 85 fake papers he had found. Monika Stickel, director of corporate communications at IEEE, says that the publisher "took immediate action to remove the papers" and "refined our processes to prevent papers not meeting our standards from being published in the future". In December 2013, Labbé informed the IEEE of another batch of apparent SCIgen articles he had found. Last week, those were also taken down, but the web pages for the removed articles give no explanation for their absence. . .

Read the rest at the site. And don't get carried away, I know some of you already know how to do this. :)