Reporting Experiments in Homeopathic Basic Research—Description of the Checklist Development
The objective of this study was to develop a criteria catalogue serving as a guideline for authors to improve quality of reporting experiments in basic research in homeopathy. A Delphi Process was initiated including three rounds of adjusting and phrasing plus two consensus conferences. European researchers who published experimental work within the last 5 years were involved. A checklist for authors provide a catalogue with 23 criteria. The “Introduction” should focus on underlying hypotheses, the homeopathic principle investigated and state if experiments are exploratory or confirmatory. “Materials and methods” should comprise information on object of investigation, experimental setup, parameters, intervention and statistical methods. A more detailed description on the homeopathic substances, for example, manufacture, dilution method, starting point of dilution is required. A further result of the Delphi process is to raise scientists' awareness of reporting blinding, allocation, replication, quality control and system performance controls. The part “Results” should provide the exact number of treated units per setting which were included in each analysis and state missing samples and drop outs. Results presented in tables and figures are as important as appropriate measures of effect size, uncertainty and probability. “Discussion” in a report should depict more than a general interpretation of results in the context of current evidence but also limitations and an appraisal of aptitude for the chosen experimental model. Authors of homeopathic basic research publications are encouraged to apply our checklist when preparing their manuscripts. Feedback is encouraged on applicability, strength and limitations of the list to enable future revisions.
Homeopathy is a widely used and highly controversial complementary therapy. The central aspect of the dispute is the use of highly diluted, “potentized" remedies which, according to conventional scientific thinking, make any biological activity highly unlikely. But also the validity of the fundamental tenet of homeopathy—the Similia Principle (like cures like)—is not generally accepted by the scientific community. Therefore, any publication of a research study in homeopathy, particularly if reporting positive results, is subjected to special scrutiny. Clinical research is of utmost importance to investigate efficacy and effectiveness. Basic research is needed to investigate possible mechanisms of action. However, as clinical research so far has failed to prove or disprove specific effects of homeopathy in general and of high potencies in particular, laboratory research is also needed to investigate whether homeopathic preparations have biological activity.
Published experiments can be assigned to four main areas of basic research on homeopathy: animal research, plant bioassays, in vitro models and physicochemical research . The HomBrex Database lists more than 1100 experiments published in about 900 publications in different fields of basic research . To appreciate and understand the methods and findings of such studies, a high degree of transparency is required from authors. Reviews have shown that many experiments differ in design and quality . Over the years, several efforts have been made to enhance the quality of basic research in homeopathy. Schulte proposed a set of standards for ultra high dilution research , Linde et al. worked out checklists for preparing and reviewing publications  and also suggestions on methodological standards  have been published. Moreover scoring systems for assessing quality were generated: the first for the use in experimental toxicology  the second for physicochemical research into homeopathic potentization . The latter score was modified and used in a systematic review on in vitro experiments .
It seemed reasonable to try to bring together all aspects of former evaluation scores and to develop with a team of experienced researchers a basic score applicable (possibly in a slightly adjusted format) in all fields of basic research on homeopathy. During the discussion process it was decided to shift the focus from the development of a scoring system to the development of a reporting checklist for authors, peer reviewers and editors (and possibly readers and authors of systematic reviews) similar to publication guidelines in clinical research (CONSORT, REDHOT). This article describes and discusses the development process of the checklist (REHBaR—Reporting Experiments in Homeopathic Basic Research). A second publication will focus on the last step of the Delphi Process (round V), where the discussed criteria and their detailed explanations as a result of the whole process are given, supported and further evidenced from other publications.
2.1. The Delphi Process: A Consensus Method
From the existing consensus methods we chose a Delphi approach. Delphi may be characterized as a method for structuring a group communication process so that the process is effective in allowing a group of individuals, as a whole, to deal with a complex problem. In order to obtain a useful result for their objective there is a need to structure a group communication process . This Delphi technique is applicable to a variety of questions ending up with different outcomes. In our case, the goal was to create an agreed checklist with criteria important for improving the quality of reports in basic research. A search for papers in the databases PubMed and CAMbase with the following keywords was conducted until July 2009: guideline, reporting, publication, catalogue, basic, clinical, fundamental, research, author, quality, CONSORT.
2.2. Preparatory Work
Before starting the Delphi process a preliminary collection of 58 potentially relevant items was compiled as preparatory work. The collection comprised all items included in the two scores developed for systematic reviews in basic research [7, 8]. Additional items were included on the basis of informal discussion with experts and the corresponding author.
2.3. The Delphi Process (Round I–V)
In the Delphi process there were a total of five rounds, including two consensus conferences (see Figure 1). In Delphi I, the list of 58 items primed in the preparatory phase by the initiators was sent to all participants. All experts were asked to rate the importance of each item on a 5-point Likert scale, and to add further items. Delphi II—the first conference—offered the opportunity to clarify misunderstandings and misconceptions and discuss the objective of the catalogue in general. In Delphi III a modified catalogue was prepared taking into account the revised objective (see “Results"), which was again discussed in the conference Delphi IV. In Delphi V, the reporting document was prepared and iteratively revised in an internal review process.
2.4. Team of Experts
Members of the panel were S. Baumgartner, L. Betti, C. Endler and R. van Wijk as leading European researchers in homeopathic experiments with plants, animals, evolutionary biology as well as biochemical and biophysical research. K. Linde and C. Witt were involved in Delphi for two reasons: first they are well known experts in the field of clinical research concerned with quality questions, and secondly both are experienced in basic research and developed a score for quality assessment in basic research [7, 8]. R. Lüdtke was responsible for the statistical aspects, H. Albrecht, G. Dobos and F. Musial for general support in questions of basic research.
3.1. Delphi Round I: Rating and Selection of Items
In the first round, all members of the team rated the relevance of the 58 items included in the preliminary list. Out of 58 items, 18 were rated as “absolutely necessary" (mean: 3.5–4) and 31 items as “moderately necessary" (mean: 2.5–3). No item was rated “moderately unnecessary" or “strongly unnecessary", but nine items received neutral rating (mean: 2). Only one item was added and discussed in the conference.
3.2. Delphi Round II: The First Consensus Conference
In the first conference we came to the conclusion that our initial intention—creating a new evaluation score—should be postponed. A criteria catalogue for improving the quality of reporting experiments was considered a priority. The main reason for this was that accurate reporting is a precondition for a valid quality assessment. Furthermore, the importance of quality scoring, in clinical research, is currently under discussion. For example, the current version of the Handbook of the Cochrane Collaboration, a worldwide network for performing systematic reviews on clinical research, explicitly discourages the use of quality scores : it is argued that while this method of scoring is simple, it is not supported by empirical evidence. The calculation of summary scores involves assigning “weights" to different items in the scale, and it is difficult to justify the weights assigned. Furthermore, scales have been shown to be unreliable assessments of validity . Therefore it is regarded as preferable to use simple approaches for assessing validity that can be fully reported (i.e., how each trial was rated on each criterion).
3.3. Delphi Round III: Phrasing of the Checklist
3.4. Delphi Round IV: The Second Consensus Conference
Another conference was held in order to discuss the reassessed first checklist looking at the purposes of exploratory and confirmatory experiments. The final list, unanimously agreed upon, is presented in Table 1: “Items to be included when Reporting Experiments in Homeopathic Basic Research". It was then decided to refrain from establishing an evaluation score. As an additional step, the group selected those items from the first list that were considered to be the minimum essential information needed for reviewing publications in this field. This new catalogue includes all parts readers should know to clearly follow the experiments and to fully understand the results. The catalogue will be published after an internal reviewing process and test of practicability.
3.5. Delphi Round V: Rewording of the Checklist Supplemented with Explanations
In the last round detailed explanations for each item based on the first catalogue were written, reassessed by all participants and supplemented with examples (available from the corresponding author). Often it is not possible to give all information due to space limitations of the journal. Therefore we recommend authors to refer to a website address to make further information available. If certain methods or the procedure of positive and negative controls are already published, it may be sufficient to refer to that publication. As publications often report on several experiments performed with regard to one research question, the report should distinguish between the single experiments and make them obvious to the reader. Furthermore, it is important to make clear if the experiment was designed for explorative or confirmatory purpose and whether it followed a specific hypothesis or not.
Accurate reporting is a prerequisite for critical interpretation of any research study and its findings. Without accurate reporting it is not possible to assess the merits of a study. Any positive findings of basic research experiments on high dilutions or potencies are likely to stir controversy in the academic community. If details in the publication (or an additional accessible report) are insufficient to allow repetition of experiments performed or to assess potential threats to validity, the value of a potentially important experiment is strongly compromised. In an 18-month Delphi process we have tried to come up with a checklist aimed at helping authors to prepare high quality manuscripts on their homeopathic basic research experiments, and at helping peer reviewers, editors and readers to check whether reporting accurately reflects their experiments.
Shortcomings in reporting experiments are not at all unique to homeopathy but a general phenomenon. The limited space in journals, specific style and editorial pressure often force authors to shorten their paper to an extent where a detailed description becomes impossible. However, even with limited space, a good manuscript can provide a lot of relevant details, and the internet makes it possible to make more detailed reports easily available to interested researchers. In clinical and epidemiological research several checklists to improve reporting quality have been developed within the last number of years. The first attempt to improve quality of reporting randomized clinical trials was the CONSORT statement in 1996 . This document has been revised in 2001  and 2005 , and recently further documents have become available for improving the reporting of abstracts , pragmatic trials , non-pharmacological trials  (see also http://www.consort-statement.org/). The necessity of CONSORT was recently encouraged by the poor standards common in reporting RCTS in Tai Chi interventions  as well as the poor standard of reporting in Chinese journals .
QUORUM provides a guideline for reporting meta-analyses [21, 22]. An evolution of this guideline was developed by an international group: Preferred Reporting Items for Systematic reviews and Meta-Analyses (PRISMA) . For observational studies, including cohort, case-control and cross-sectional studies STROBE emerged , recently extended with STREGA (STRengthening the Reporting of Genetic Association studies) . Comparable lists are available for trials in acupuncture: STRICTA  is already discussed  and assessed . For complete and accurate reporting of studies of diagnostic accuracy (STARD) an item list was compiled . Standards for Quality Improvement Reporting Excellence (SQUIRE) were published to enhance reporting on quality improvement studies in health care . RedHot , an unofficial extension for CONSORT, was established to assist reporting on homeopathic treatments in clinical trials. A further CONSORT statement was compiled  concerning randomized, controlled trials of herbal interventions, evaluated in a systematic review of instruments developed to critically assess the quality of trials on the efficacy of natural health products . A complete collection of available reporting guidelines can be found at EQUATOR (Enhancing the QUAlity and Transparency Of health Research), a new international initiative based on a network concept .
Basic research is different from clinical and epidemiological research, and reporting guidelines comparable to those described above for clinical research do not exist in this area outside of homeopathy. In biological sciences there are some groups working on standards of data presentation. These initiatives were derived from research groups dealing with microarray experiments and provide several checklists on how to report and deal with large numbers of data . An overview of these activities can be found at the homepage of MIBBI (Minimum Information for Biological and Biomedical Investigations) http://www.mibbi.org or the MGED (Microarray Gene Expression Data) society.
Our checklist is compiled for experiments on homeopathy and how to report each step of experiment which is important to understand and appreciate the results.
A first proposal for a reporting guideline in homeopathy was introduced in 1991 . It focused on experiments dealing with ultra low dose effects (serial dilutions and potencies).
The group refrained from establishing a score for assessing quality, as had been common in the past, in clinical research . In the current state it was thought that it is not feasible to propose clear-cut criteria for assessing what constitutes good and bad basic research beyond generally accepted standards. Furthermore, a lot of basic research experiments are explorative and often procedures cannot follow a predictable outline. Research questions tend to be more complex than the methodologically straightforward question of efficacy. Unexpected effects are common, either due to the potentization process, the level of potentization or the substance itself. In exploratory experiments it is often not obvious to which part of the intervention the effect refers to and to what extent. A precise description of the manufacturing and the Pharmacopoeias of the test and control substances are therefore indispensable. If the expected effect is caused by the succussion process only, unpotentized solvent is the adequate control. If the effect is expected to be caused by a potentized substance, a potentized solvent or another potentized substance (at the same potency level) is adequate. When investigation was performed on the Similia Principle, a variety of substances, which represent different degrees of similarity with the diseased state, can be chosen as control. It is important to explain why which control was selected relating to the underlying questions of research, for example, the Similia Principle or Isopathy. In this context the role of individualization in experiments performed about the Similia Principle or the definition of Isopathy should be taken into account.
Whether quality assessment scores make sense in the future remains to be discussed. In clinical research there is a clear trend to assess single components of quality and investigate their impact on outcomes instead of using questionable summary scores. “One commonly-used scale was developed by Jadad and colleagues for randomized trials in pain research . The use of this scale is explicitly discouraged. As well as suffering from the generic problems of scales, it has a strong emphasis on reporting rather than conduct, and does not cover one of the most important potential biases in randomized trials, namely allocation concealment." (Cochrane Handbook, Chapter 8.3.3) .
REHBaR was developed with a standard consensus method among an international team of researchers with experience in basic and clinical research in homeopathy, experimental physiology, general research methodology and statistics. All members commented on intermediate and the final version of the checklist. Obviously, the team was a relatively small sample of individuals, and other researchers might have introduced other items. The current list has to be considered as a first try and it is hoped that it proves useful to enhance the quality of reporting basic research experiments in homeopathy. We encourage the use of the list, critical feedback and hope to be able to provide an improved version in a few years.
The authors are grateful to logistic support by the German Karl and Veronica Carstens-Foundation offering a platform for communication and coordination of the Delphi process. Furthermore, the foundation provided a grant for two consensus conferences.
C. M. Witt, M. Bluth, H. Albrecht, T. E. R. Weißhuhn, S. Baumgartner, and S. N. Willich, “The in vitro evidence for an effect of high homeopathic potencies—a systematic review of the literature,” Complementary Therapies in Medicine, vol. 15, no. 2, pp. 128–138, 2007.View at: Publisher Site | Google Scholar
J. Schulte, “About quality and standards in ultra high dilution research and research on homeopathy,” in Ultra High Dilution—Physiology and Physics, P. C. S. J. Endler, Ed., pp. 171–176, Kluwer Academic Publishers, Dordrecht, The Netherlands, 1994.View at: Google Scholar
S. Baumgartner, P. Heusser, and A. Thurneysen, “Methodological standards and problems in preclinical homoeopathic potency research,” Forschende Komplementarmedizin und Klassische Naturheilkunde, vol. 5, no. 1, pp. 27–32, 1998.View at: Google Scholar
K. Linde, W. B. Jonas, D. Melchart, F. Worku, H. Wagner, and F. Eitel, “Critical review and meta-analysis of serial agitated dilutions in experimental toxicology,” Human and Experimental Toxicology, vol. 13, no. 7, pp. 481–492, 1994.View at: Google Scholar
C. Becker-Witt, T. E. R. Weißhuhn, R. Lüdtke, and S. N. Willich, “Quality assessment of physical research in homeopathy,” Journal of Alternative and Complementary Medicine, vol. 9, no. 1, pp. 113–132, 2003.View at: Google Scholar
D. Moher, K. F. Schulz, and D. G. Altman, “The CONSORT statement: revised recommendations for improving the quality of reports of parallel-group randomized trials,” Journal of the American Podiatric Medical Association, vol. 91, no. 8, pp. 437–442, 2001.View at: Google Scholar
J. P. Vandenbroucke, E. Von Elm, D. G. Altman et al., “Strengthening the reporting of observational studies in epidemiology (STROBE): explanation and elaboration,” Annals of Internal Medicine, vol. 147, no. 8, pp. W163–W194, 2007.View at: Google Scholar
D. Moher, K. F. Schulz, and D. Altman, “The CONSORT statement: revised recommendations for improving the quality of reports of parallel-group randomized trials 2001,” Explore, vol. 1, no. 1, pp. 40–45, 2005.View at: Google Scholar
I. Boutron, F. Tubach, B. Giraudeau, and P. Ravaud, “Methodological differences in clinical trials evaluating nonpharmacological and pharmacological treatments of hip and knee osteoarthritis,” Journal of the American Medical Association, vol. 290, pp. 1062–1070, 2003.View at: Google Scholar
J. Y. Li, Y. F. Zhang, G. S. Smith, C. J. Xue, Y. N. Luo, and W. H. Chen, “Quality of reporting of randomized clinical trials in Tai Chi interventions—a systematic review,” Evidence-Based Complementary and Alternative Medicine. In press.View at: Google Scholar
D. Moher, D. J. Cook, S. Eastwood, I. Olkin, D. Rennie, and D. F. Stroup, “Improving the quality of reports of meta-analyses of randomised controlled trials: the QUOROM statement. Quality of reporting of meta-analyses,” Lancet, vol. 354, pp. 1896–900, 1999.View at: Google Scholar
D. L. Turpin, “CONSORT and QUOROM guidelines for reporting randomized clinical trials and systematic reviews,” American Journal of Orthodontics and Dentofacial Orthopedics, vol. 128, pp. 681–685, 2005.View at: Google Scholar
D. Moher, A. Liberati, J. Tetzlaff, and D. G. Altman, “Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement,” Journal of Clinical Epidemiology, vol. 62, pp. 1006–1012, 2009.View at: Google Scholar
J. Little, J. P. T. Higgins, J. P. A. Ioannidis et al., “Strengthening the reporting of genetic association studies (STREGA)-an extension of the strengthening the reporting of observational studies in epidemiology (STROBE) statement,” Journal of Clinical Epidemiology, vol. 62, no. 6, pp. 597–608, 2009.View at: Publisher Site | Google Scholar
S. L. Prady and H. MacPherson, “Assessing the utility of the standards for reporting trials of acupuncture (STRICTA): a survey of authors,” Journal of Alternative and Complementary Medicine, vol. 13, pp. 939–943, 2007.View at: Google Scholar
J. J. Gagnier, H. Boon, P. Rochon, D. Moher, J. Barnes, and C. Bombardier, “Reporting randomized, controlled trials of herbal interventions: an elaborated CONSORT statement,” Annals of Internal Medicine, vol. 144, no. 5, pp. 364–367, 2006.View at: Google Scholar
A. M. Whelan, T. M. Jurgens, and L. Lord, “Evaluating the quality of randomized controlled trials that examine the efficacy of natural health products: a systematic review of critical appraisal instruments,” Evidence-Based Complementary and Alternative Medicine. In press.View at: Publisher Site | Google Scholar
J. Hornung and K. Linde, “Guidelines for the exact description of the preparation and mode of application of serial dilutions and potencies in publications on ultra low dose effects and homeopathic research—a proposal,” Berlin Journal of Research in Homeopathy, vol. 1, pp. 121–123, 1991.View at: Google Scholar