|
Traditional Medicine and Complementary/Alternative Medicine Subjective scales for the evaluation of therapeutical effects and their use in complementary medicine
A. LIVERANI
E. MINELLI
A. RICCIUTI Abstract Using existing systems for appraising the efficacy of various therapies,
regardless of their typology, the authors considered about ten methods commonly
used to evaluate quality of life. They give guidelines on how to build
another index for judging quality of life taking into account the patient's
conditions of life. These indexes share similar problems in finding objective
measures for assessing personal situations without regard to patient's or
observer's subjective impressions.
Introduction Discussion about the effectiveness of alternative therapies is still
inconclusive. In the following paper "scientific" medicine will be
referred to as "conventional" to identify it from "complementary" and "alternative"
therapies.
This paper will focus on point 3.a. to examine the use of different instruments already validated (tests for evaluation of QOL) for recording changes of a patient's condition from a subjective perspective. This type of evaluation is not linked to the type of therapy used. However, it is appropriate when complementary therapies are used, with or without the conventional therapies.evaluation of the QOL, independent of the instruments used to improve it, is increasingly used for chronic-degenerative diseases, for which there are no sure treatments. This is an area where combination of conventional and complementary therapies is gaining popularity. A domain of possible observations accepted by the supporters of different medical approaches needs to be identified. WHOQOL-100 and NIH methodological manifestoDifferent resolutions made by WHO (Consultation on Acupuncture, Milan,
Nov. 1996), take the stance that, in those countries where conventional
biomedicine is the basis of medical courses, the ethical use of complementary
medicines must be backed by controlled clinical tests of its effectiveness. WHOQOL-100
Fig 1 - Classification of the domains related to the quality of life (Division of Mental Health WHO. Field Trial WHOQOL-100, Facet Definitions and Questions. MNH/PSF/95.1.B, Geneva, Switzerland, 1995) Conversely, some groups of scientists, even if skeptical of RCTs and
"reductionist" approaches, think that alternative therapies are not so unusual
and can be studied with existing methodologies. A strong position
was enunciated by the Working Group on Quantitative Methods in Alternative
Medicine of the NIH (Levin et al., 1997), which resulted in a kind of Methodological
Manifesto, the aim of which is to focus on problems and solutions and guidelines
for the evaluation of CAM (Complementary and Alternative Medicine). In this
document, in contrast to statements from many researchers and doctors involved
in CAM, the validity of existing research methodology and analytical procedures
is underlined, while also comparing the non-conventionality of the hypotheses
and of the therapy conclusions of CAM. The Methodological Manifesto The principal methodological problems reflect the opinio of many non-conventional practitioners, that their therapies or proposed mechanisms are so unusual they cannot be evaluated or investigated with conventional or existing tools. These can be summarised as follows:
The Methodological Manifesto consists of seven guidelines and recommendations completed by the Working Group on Quantitative Methods in Alternative Medicine of the NIH.
Finally, the Working Group on Quantitative Methods in Alternative Medicine emphasized that randomization is an important principle for ensuring that studies and their results are representative and generalizable. Weakness of controlled randomized clinical trials The randomized clinical trials (RCT) is the principle tool of clinical
research and to document the effect (positive or negative) of a therapy
we rely on the work of Theodore Pincus (1997). It must be observed
that the RCT is based on the standard medical record. One inadequacy of
this approach is its inability to record professional "acta" more than their
utility or failure. Additionally, it is not designed to evaluate the effectiveness
of a long term treatment. These limits can be linked to the evolution
of the medical record within the "biomedical model", the fundamental paradigm
of medicine of the XX century (Engel, 1977, Holman, 1976, Sagan, 1987). The rise of controlled randomized clinical trials To overcome these limitations the randomised controlled clinical trials, designed and developed in the early '50s (Daniels & Hill, 1952), are regarded the optimal method for the evaluation of the effectiveness of any intervention. RCTs are effective in evaluating treatments in the acute phases of diseases that evolve in short periods (with easily identifiable end-points) and for documenting side effects of drugs. However, for the predominant problems of our time, the study of chronic diseases, characterised by problems that evolve in the medium and long term, where end-points are not easily identifiable, and that need "control", "care" and "palliation", the RCT method is not adequate. The rigid and limiting procedures imposed by the theoretical criteria underlying these studies can, in some cases, prevent or compromise the patient's long term optimal treatment. For example, studies on patients affected by rheumatoid arthritis, showed that drugs considered effective in treating this pathology in the short run, but with high toxicity and secondary effects (metotrexate, D-penicillamine, sulfasalasine, ect) in some cases lead to interruption of the treatment, have the same, if not lowered, effectiveness in the long term as drugs considered to be less effective in short term clinical trials. Limits of controlled randomised clinical trails The limitations of the RCTs may be grouped into two categories:
Six pragmatic limitations of randomised clinical trails. 1. Exclusion criteria in randomized trials In all randomized clinical trials, there are exclusion criteria that restrict the selection of certain patients to minimize the interference of factors that can bias the results. It follows that many interventions are tested on a minority of patients. The exclusion criteria are defined by the researcher and may be exigent (high number of exclusion criteria) or practical (low number of exclusion criteria). 2. The relatively short observation period in most of the clinical trials of chronic diseases In acute diseases the observation period of a few months may be sufficient to recognise the effectiveness of the treatment and to identify possible secondary effects. However, in chronic diseases, a few months, one year or few years (the observations periods for most clinical trials) is to short too evaluate fully the effectiveness of the treatment. 3. Standard dosages imposed by protocols Most randomized controlled trials establish rigid doses to guarantee "scientific" criteria. A patient who reports drug efficacy but severe gastrointestinal distress might be withdrawn from the trial, even if, in everyday practice, the dosage might be reduced and therapy continued effectively without secondary effects. In a trial the idea of personalized therapy, the maximum possible dosage for a particular patient and not the maximum possible dosage is ignored. Therefore, rigidly designed clinical trials risk underestimating the effectiveness of the drug and overestimating the toxicity. The clinical behavior imposed by rigid protocols does not reflect everyday clinical practice. 4. The use of markers that in several cases are not appropriate for tracing the evolution of the pathology In one trial of one or a few years it is not possible to evaluate the long term outcomes of a chronic disease, like death or disability. Often the markers used in trials for chronic diseases are not suitable and reliable for tracing the future of the pathology (for example the carcinoembryonic antigen [CEA] test), and therefore are not a good guide for reaching the long term objectives of the medical treatment. 5. Significant statistical results are not necessarily clinically important and viceversa. The effectiveness of a clinical trial is evaluated by the statistical significance of the difference of results of the studied treatment versus placebo or different treatments. However, significant statistical results may not correspond to a clinical point of view (for example whwn any improvement of a physical parameter goes on with a reduction of quality of life) . Moreover, the need for large numbers in order to reach statistical significance implies problems for rare pathologies. 6. Tendency to ignore important factors not previewed in the research but that affect the results In many clinical trials, important factors linked to patients bias the results more than the treatment (or of the placebo) for which the patients where randomised (many examples cn be found in researches on antihyperlipidemia dugs that do not consider the nutritional habits of the patient). However, this information is generally ignored or not emphasised. Four intrinsic considerations. 1. The design of a clinical trial may influence the results despite the inclusion of a control group The design of an RCT may considerably shift the probability that an intervention will or will not appear to be more effective than a placebo. The RCT is not by definition neutral and the existence of a control group does not necessarily eliminate biases which are intrinsic in the design of a study 2. Clinical trials results generally refer to groups of patients and ignore the variability between individuals Consider a typical trial in which 60% of individuals respond more favourably to drug A than to drug B, 20% respond more favourably to drug B, and 20% find both drugs of equal effectiveness. The interpretation of the results is that drug A is superior to drug B for all patients, although a more precise interpretation might be that drug A is superior to drug B for certain individuals while the opposite is true for others.Many hospital stock only one single drug due to economic constraints, regardless of the fact that each drug is effective only for certain individuals. 3. Interpretation of side effects is not standardised and introduces biases into results of clinical trials 4. The design of clinical trials distorts the placebo effect by informing the patients that they are under study and may not receive the "best" therapy Patients who participate in clinical trials are fewer than 0.001% of all patients treated by drugs (Pincus, 1997). The exclusion criteria usually mean that the sample under investigation is not representative of the universe of patients with the disease nor of patients under treatment. Moreover, costs of clinical trials cannot easily be surmounted. It is important to emphasise the value of clinical trials in documenting long term safety, or the absence of safety, in contrast to effectiveness. Weaknesses of meta-analysis modelsThe critique of RCTs and of recent statistic models led to the development
of meta-analytic models. Meta-analysis can reach significant results when
different sampling is carried out, for example cross-country sampling grouped
to one particular research variable. Meta-analysis was designed as a tool
to link non-coordinated research on similar topics carried out in different
places. The effectiveness of the result is arguable as the newly elaborated
indices tend to lose specificity. Patient Self-Report Questionnaires The same difficulties can be overcome by using patients self report-questionnaires
(PSRQ) to monitor pathology and long term effectiveness during each meeting
with the patient. They may also be helpful to the practitioner at the intellectual,
professional and ethical level. These questionnaires can be used to evaluate
the effectiveness of treatments in conventional, complementary or
integrated medicine, and are an adequate tool to evaluate the general effectiveness
of the therapeutic treatment on the patient.
A questionnaire of this kind could be a useful tool to control the quality of professional activity. The costs of implementing such a system of questionnaires is approximately 1-3% of the total costs of treatment. This cost is justified in terms of the practitioner professional ethics and responsibility toward the patient and toward himself. Comparative analysis of subjective scales When comparing different subjective scales, the researchers considered the following categories, for giving different weight to the information collected:
The large number and the typology of subjective scales for clinical use is well known. It is basically an attempt to codify the impression of the practitioner and/or of the patient toward the treatment ie to describe a real situation. It is therefore quite hard to collect all the material produced unless limiting the research, for example, to one specific pathology. This is what has been done by on the work of M. Tamburini in evaluating cancer treatments. The following methods have been evaluated. KPS: the Karnofsky Performance Status Scale, proposed in 1948, is the best known of the tests in use. Even if it is not used for measuring functional integrity, the symptoms and/or the signs of disease and the toxicity of the treatment can be assessed but, more generally, the quality of life. The KPS uses a method of evaluation in which the maximum score corresponds to the normal status of health. QL index: The quality of life index proposed by Spitzer in 1981, introduces a global measure of quality of life for cancer patients. The scales that are used in five activity areas have been criticized in practice. However, the QL index has the merit of proposing a mediate evaluation and therefore relates well to patient's status connected to quality of life. RSCL: The Rotterdam Symptom Checklist is the most recent, developed in 1990 (De Haels et Al.), as a tool to measure globally the quality of life for cancer patients. The RSCL is a very innovative instrument and is still under study for some methodological and administrative aspects. SCI: The SCI proposed by Tamburini in 1991 has the objective of measuring pain during chemotherapy and is based on the idea of counting the number of treatment days of pain. The method builds on methods of similar questionnaires and is easy to complete. SDS: The Symptom Distress Scale (McCorkle 1978) aims to evaluate the degree of suffering caused by cancer patients' symptoms according to 13 linked aspects evaluated on a five point ordinal scale. The test demonstrates the problem of patient's learning process that can bias the data when repeated TIQ: The therapy Impact Quality (Ventafridda et Al 1990), conceived for evaluating the effects of therapy for terminal patients, evaluates the quality of life in four domains using 36 items plus 1 general item. The test can be used by non specialized personnel. VAS/LASA: The VAS/ Linear Analogue Self Assessment (Scott et Al. 1976, Priestman et Al 1976, Carlsson 1983, Jensen et Al 1986) are general tools, well established for medical and psychological parameters. A great amount of research has been done to evaluate and compare this tool to other analytical instruments (verbal and numerical scales). It is easy to observe that the different methodological approaches may be grouped into three types:
The first category can be used in all pain situations, as it is not specific. However, it can strongly depend on the type of observed subject and psychic component. The second category instruments shows that to reach some result linked to the therapy the inventory must be repeated. The third class methods try to identify some objective measure of distress and to compare the state of two patients. Weaknesses common to all include:
Weakness of subjective tools Spurious objectivity masking environmental and psychological conditions cannot be eliminated. It is the underlying structural weakness of all these observation methods. The only recourse is to define and limit as well as possible the purpose of the observation and of the research. The use of ordinal scales aims to overcome the risks of imprecision in questionnaires by use of binary answers. Two approaches are used: a) the patient is left free to choose within a well specified domain, b) the patient is given the choice between three or more options defined a priori. Without focusing on the statistical implications, the apparently greater precision of the first alternative rapidly vanishes when the effects implied by such a choice make it difficult to compare the choices of more individuals. Moreover, in the second alternative the discussion is on the number and scaling of the choices from which the patient has to select. Too many choices leads to low comparison, few choices result in poor information. Moreover, imprecise definition of the choices leads to loss of information. These investigation tools are worthy of further study, even if some points of weakness exist, when an effort is made to identify binary answer questionnaires. That is where the investigated situation are identified with the greatest precision and to the proposed questions, even if numerous, a yes / no answer can be given. References
Edited by Aldo Campana, |