Most posts on this website are about duplications within or between figures in the same paper, or about duplications found between papers by the same group of authors. But now, our small group of image forensics detectives has come across a large set of papers – over 400 as of today – from different authors and affiliations that all appear to have been generated by the same source. Based on the resemblance of the Western blot bands to tadpoles (the larval stage of an amphibian, such as a frog or a toad), we will call this the Tadpole Paper Mill.
Update: for those who cannot access the Google Sheets link above, here is a PDF version embedded in WordPress which I hope will work for all (generated March 3, 2020):
What is a paper mill?
A paper mill is a shady company that produces scientific papers at demand. They sell these papers to e.g. medical doctors in China who need to have a scientific paper published in an international journal in order to get their MD, but who do not have any time in their educational program to actually do research. Authorships on ready-to-submit or already-accepted papers are sold to medical students for hefty amounts, as described e.g. by Mara Hvistendahl in ‘China’s Publication Bazaar‘ in Science, 2013. Whether or not the experiments described in these papers has actually been performed is not clear. Some of these paper mills might have laboratories producing actual images or results, but such images might be sold to multiple authors to represent different experiments. Thus, the data included in these papers is often falsified or fabricated.
Jana Christopher, an Image Data Integrity Analyst, described such a paper mill in her 2018 paper ‘Systematic fabrication of scientific images revealed‘ in FEBS Letters.
In her paper, Christopher describes a set of 12 manuscripts that were submitted to different FEBS journals, which all contained Western blot with very regularly spaced and particularly shaped bands, unusual absence of dots, smears, or stains, and identical backgrounds. These manuscripts however, came from different research groups at different institutions.
Jennifer A Byrne, a professor at the University of Sydney has also raised concerns about such paper mill-produced fraudulent studies, in particular those describing gene knockdown studies. In her 2019 paper ‘The Possibility of Systematic Research Fraud Targeting Under-Studied Human Genes: Causes, Consequences, and Potential Solutions, she describes a cohort of gene knockdown papers with a very similar figure layout and order.
Now, Byrne and Christopher teamed up to publish another paper about paper mill publications. Their paper ‘Digital magic, or the dark arts of the 21st century—how can journals and peer reviewers detect manuscripts and publications from paper mills?‘ was published this week in FEBS Letters.
The Tadpole Paper Mill
As it turns out, Byrne and Christopher’s publication describes the exact same set of papers that our small team of image forensics detectives had been working on in the past month. While they show results from 17 unrelated manuscripts, our team, consisting of Smut Clyde, Morty, Tiger, and me has found about 400 papers so far. You can find the (still growing) list of papers here.
As described by Smut Clyde on Leonid Schneider’s blog For Better Science in a blog post called ‘The full-service paper mill and its Chinese customers‘ the background similarities between Western blot panels were first noticed by pseudonymous PubPeer user Indigofera Tanganyikensis in two PubPeer posts here and here.
Not only was the Western blot background highly similar between panels within the same figure, it was also unexpectedly similar between panels from different papers.
As previously described by Christopher (see above), the Western blot bands in all 400+ papers are all very regularly spaced and have a smooth appearance in the shape of a dumbbell or tadpole, without any of the usual smudges or stains. All bands are placed on similar looking backgrounds, suggesting they were copy/pasted from other sources, or computer generated.
Death Star Flow Cytometry plots
In addition to the similarities between the Western blot in all 400+ papers, there are other similarities too. Some of the papers contain unusually shaped flow cytometry plots that Smut Clyde beautifully described as follows (it refers to a Star Wars weapon).
The Death Star down in each frame’s quadrant Q3 is shooting out bolts of planet-smashing energy to the right through Quadrant Q4, while disciplined flotillas of Rebel Alliance X-Wing starfighters are swooping down through Q1 and Q2.Smut Clyde – ‘The full-service paper mill and its Chinese customers‘
The bar graphs in most of these papers also look very similar. They usually are in solid shades of grey, with the black bar on the left, and with double edged error bars. As with the titles, such layout similarities in figures are not necessarily a problem, and this might be e.g. the standard format for a popular graphing program. However, when the same layout is found in 100s of papers from different institutes, it is a red flag.
Publication title reveals paper mill suspects by using a similar structure
Most of the papers share a similar title structure. Note that this is not necessarily a problem by itself, in particular for papers written by authors for whom English is not a first language. However, in combination with the Western blot background similarities, and greyscale bargraphs, the title structure is one of the signs that a paper belongs to this Tadpole paper mill set. Examples of paper titles that follow this structure are:
- Bilobalide alleviates tumor necrosis factor‐alpha‐induced pancreatic beta‐cell MIN6 apoptosis and dysfunction through upregulation of miR‐153
- mir-488 inhibits cell growth and metastasis in renal cell carcinoma by targeting hMgn5
- Baicalein retards proliferation and collagen deposition by activating p38MAPK‐JNK via microRNA‐29
Here is an illustration of how the titles might be generated.
Affected institutes and journals
If you take a look at the list of 400+ papers you can see some patterns. Note that the publicly available list is always a bit behind on the list we are actively working on, so the number of papers might be a bit lower. We will try to update regularly.
The majority of the Tadpole Tales have been published by the following six journals with multiple publishing houses involved:
- Artificial Cells Nanomedicine and Biotechnology (Taylor & Francis): 76 papers
- Journal of Cellular Biochemistry (Wiley): 57 papers
- Biomedicine & Pharmacotherapy (Elsevier): 45 papers
- Cellular Physiology and Biochemistry (Karger): 26 papers
- Experimental and Molecular Pathology (Elsevier): 26 papers
- Journal of Cellular Physiology (Wiley): 21 papers
The earliest papers from the Tadpole paper mill are from 2016, although the bulk of the papers have appeared in 2018-2020.
Many of the papers appear to have been authors affiliated at hospitals or medical schools, which is not surprising since having at least one publication is part of the requirements to become an MD in medical schools in China.
- Jining No. 1 People’s Hospital: 101 papers
- China‐Japan Union Hospital of Jilin University: 59 papers
- The Affiliated Hospital of Qingdao University: 23 papers
- Linyi Central Hospital: 16 papers
- The First Affiliated Hospital of Zhengzhou University: 16 papers
- Affiliated Hospital of Jining Medical University: 12 papers
Although more than 100 papers were authored by doctors from Jining No. 1 People’s Hospital in the Shandong province, there is little overlap between the authors. The papers were affiliated with a range of different departments, from Pediatrics (15 papers), Cardiology (6), Endocrinology (6), Nephrology (6), and Vascular Surgery (5).
Raising the alarm
This large set of presumably falsified papers is cause for great concern. Our small team of volunteers has found 400 papers, based on similarities in titles, keywords and layout. These were relatively easy to find, although we did put in many hours of unpaid work to find these papers. One can only guess how many other falsified publications have been produced by other paper mills using less recognizable and more variable layouts. The number of such paper mill articles might be in the thousands.
It is also sad to see that all these papers have been published after peer-review with apparently little editorial quality control.
As suggested by both Christopher and Byrne in the papers mentioned above, critical screening of images and other data should be an essential part of the pre-publication process of scientific papers. Scientists pay money to either publish or read these papers (depending on whether they are paywalled or open access), and they might spend months or years trying to replicate these results. Publishers should therefore do a much better job to screen papers before they accept papers for publication, and they should not just rely on unpaid and untrained peer reviewers or post-publication volunteer work to detect fabricated images.
In addition, it is of great concern to see that this specific paper mill has successfully “infected” particular journals. An anonymous source has even suggested that some of these journals might be actively working together with such paper mills, and promote the generation of fake papers and selling of such publications to the large army of aspiring MDs desperate to finish their degrees. Although this rumor has not yet been confirmed, it is very alarming to see that journal editors do not appear to have noticed the similarities between dozens of papers published in their journals.
As a scientific community, we need to work harder in detecting such fraudulent papers. Finding these fabricated images should not rely solely on the work of unpaid volunteers.