U.S. flag

An official website of the United States government

The .gov means it’s official. Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

The site is secure. The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

  • Publications
  • Account settings

Preview improvements coming to the PMC website in October 2024. Learn More or Try it out now .

  • Advanced Search
  • Journal List
  • J Family Med Prim Care
  • v.4(3); Jul-Sep 2015

Validity, reliability, and generalizability in qualitative research

Lawrence leung.

1 Department of Family Medicine, Queen's University, Kingston, Ontario, Canada

2 Centre of Studies in Primary Care, Queen's University, Kingston, Ontario, Canada

In general practice, qualitative research contributes as significantly as quantitative research, in particular regarding psycho-social aspects of patient-care, health services provision, policy setting, and health administrations. In contrast to quantitative research, qualitative research as a whole has been constantly critiqued, if not disparaged, by the lack of consensus for assessing its quality and robustness. This article illustrates with five published studies how qualitative research can impact and reshape the discipline of primary care, spiraling out from clinic-based health screening to community-based disease monitoring, evaluation of out-of-hours triage services to provincial psychiatric care pathways model and finally, national legislation of core measures for children's healthcare insurance. Fundamental concepts of validity, reliability, and generalizability as applicable to qualitative research are then addressed with an update on the current views and controversies.

Nature of Qualitative Research versus Quantitative Research

The essence of qualitative research is to make sense of and recognize patterns among words in order to build up a meaningful picture without compromising its richness and dimensionality. Like quantitative research, the qualitative research aims to seek answers for questions of “how, where, when who and why” with a perspective to build a theory or refute an existing theory. Unlike quantitative research which deals primarily with numerical data and their statistical interpretations under a reductionist, logical and strictly objective paradigm, qualitative research handles nonnumerical information and their phenomenological interpretation, which inextricably tie in with human senses and subjectivity. While human emotions and perspectives from both subjects and researchers are considered undesirable biases confounding results in quantitative research, the same elements are considered essential and inevitable, if not treasurable, in qualitative research as they invariable add extra dimensions and colors to enrich the corpus of findings. However, the issue of subjectivity and contextual ramifications has fueled incessant controversies regarding yardsticks for quality and trustworthiness of qualitative research results for healthcare.

Impact of Qualitative Research upon Primary Care

In many ways, qualitative research contributes significantly, if not more so than quantitative research, to the field of primary care at various levels. Five qualitative studies are chosen to illustrate how various methodologies of qualitative research helped in advancing primary healthcare, from novel monitoring of chronic obstructive pulmonary disease (COPD) via mobile-health technology,[ 1 ] informed decision for colorectal cancer screening,[ 2 ] triaging out-of-hours GP services,[ 3 ] evaluating care pathways for community psychiatry[ 4 ] and finally prioritization of healthcare initiatives for legislation purposes at national levels.[ 5 ] With the recent advances of information technology and mobile connecting device, self-monitoring and management of chronic diseases via tele-health technology may seem beneficial to both the patient and healthcare provider. Recruiting COPD patients who were given tele-health devices that monitored lung functions, Williams et al. [ 1 ] conducted phone interviews and analyzed their transcripts via a grounded theory approach, identified themes which enabled them to conclude that such mobile-health setup and application helped to engage patients with better adherence to treatment and overall improvement in mood. Such positive findings were in contrast to previous studies, which opined that elderly patients were often challenged by operating computer tablets,[ 6 ] or, conversing with the tele-health software.[ 7 ] To explore the content of recommendations for colorectal cancer screening given out by family physicians, Wackerbarth, et al. [ 2 ] conducted semi-structure interviews with subsequent content analysis and found that most physicians delivered information to enrich patient knowledge with little regard to patients’ true understanding, ideas, and preferences in the matter. These findings suggested room for improvement for family physicians to better engage their patients in recommending preventative care. Faced with various models of out-of-hours triage services for GP consultations, Egbunike et al. [ 3 ] conducted thematic analysis on semi-structured telephone interviews with patients and doctors in various urban, rural and mixed settings. They found that the efficiency of triage services remained a prime concern from both users and providers, among issues of access to doctors and unfulfilled/mismatched expectations from users, which could arouse dissatisfaction and legal implications. In UK, a care pathways model for community psychiatry had been introduced but its benefits were unclear. Khandaker et al. [ 4 ] hence conducted a qualitative study using semi-structure interviews with medical staff and other stakeholders; adopting a grounded-theory approach, major themes emerged which included improved equality of access, more focused logistics, increased work throughput and better accountability for community psychiatry provided under the care pathway model. Finally, at the US national level, Mangione-Smith et al. [ 5 ] employed a modified Delphi method to gather consensus from a panel of nominators which were recognized experts and stakeholders in their disciplines, and identified a core set of quality measures for children's healthcare under the Medicaid and Children's Health Insurance Program. These core measures were made transparent for public opinion and later passed on for full legislation, hence illustrating the impact of qualitative research upon social welfare and policy improvement.

Overall Criteria for Quality in Qualitative Research

Given the diverse genera and forms of qualitative research, there is no consensus for assessing any piece of qualitative research work. Various approaches have been suggested, the two leading schools of thoughts being the school of Dixon-Woods et al. [ 8 ] which emphasizes on methodology, and that of Lincoln et al. [ 9 ] which stresses the rigor of interpretation of results. By identifying commonalities of qualitative research, Dixon-Woods produced a checklist of questions for assessing clarity and appropriateness of the research question; the description and appropriateness for sampling, data collection and data analysis; levels of support and evidence for claims; coherence between data, interpretation and conclusions, and finally level of contribution of the paper. These criteria foster the 10 questions for the Critical Appraisal Skills Program checklist for qualitative studies.[ 10 ] However, these methodology-weighted criteria may not do justice to qualitative studies that differ in epistemological and philosophical paradigms,[ 11 , 12 ] one classic example will be positivistic versus interpretivistic.[ 13 ] Equally, without a robust methodological layout, rigorous interpretation of results advocated by Lincoln et al. [ 9 ] will not be good either. Meyrick[ 14 ] argued from a different angle and proposed fulfillment of the dual core criteria of “transparency” and “systematicity” for good quality qualitative research. In brief, every step of the research logistics (from theory formation, design of study, sampling, data acquisition and analysis to results and conclusions) has to be validated if it is transparent or systematic enough. In this manner, both the research process and results can be assured of high rigor and robustness.[ 14 ] Finally, Kitto et al. [ 15 ] epitomized six criteria for assessing overall quality of qualitative research: (i) Clarification and justification, (ii) procedural rigor, (iii) sample representativeness, (iv) interpretative rigor, (v) reflexive and evaluative rigor and (vi) transferability/generalizability, which also double as evaluative landmarks for manuscript review to the Medical Journal of Australia. Same for quantitative research, quality for qualitative research can be assessed in terms of validity, reliability, and generalizability.

Validity in qualitative research means “appropriateness” of the tools, processes, and data. Whether the research question is valid for the desired outcome, the choice of methodology is appropriate for answering the research question, the design is valid for the methodology, the sampling and data analysis is appropriate, and finally the results and conclusions are valid for the sample and context. In assessing validity of qualitative research, the challenge can start from the ontology and epistemology of the issue being studied, e.g. the concept of “individual” is seen differently between humanistic and positive psychologists due to differing philosophical perspectives:[ 16 ] Where humanistic psychologists believe “individual” is a product of existential awareness and social interaction, positive psychologists think the “individual” exists side-by-side with formation of any human being. Set off in different pathways, qualitative research regarding the individual's wellbeing will be concluded with varying validity. Choice of methodology must enable detection of findings/phenomena in the appropriate context for it to be valid, with due regard to culturally and contextually variable. For sampling, procedures and methods must be appropriate for the research paradigm and be distinctive between systematic,[ 17 ] purposeful[ 18 ] or theoretical (adaptive) sampling[ 19 , 20 ] where the systematic sampling has no a priori theory, purposeful sampling often has a certain aim or framework and theoretical sampling is molded by the ongoing process of data collection and theory in evolution. For data extraction and analysis, several methods were adopted to enhance validity, including 1 st tier triangulation (of researchers) and 2 nd tier triangulation (of resources and theories),[ 17 , 21 ] well-documented audit trail of materials and processes,[ 22 , 23 , 24 ] multidimensional analysis as concept- or case-orientated[ 25 , 26 ] and respondent verification.[ 21 , 27 ]

Reliability

In quantitative research, reliability refers to exact replicability of the processes and the results. In qualitative research with diverse paradigms, such definition of reliability is challenging and epistemologically counter-intuitive. Hence, the essence of reliability for qualitative research lies with consistency.[ 24 , 28 ] A margin of variability for results is tolerated in qualitative research provided the methodology and epistemological logistics consistently yield data that are ontologically similar but may differ in richness and ambience within similar dimensions. Silverman[ 29 ] proposed five approaches in enhancing the reliability of process and results: Refutational analysis, constant data comparison, comprehensive data use, inclusive of the deviant case and use of tables. As data were extracted from the original sources, researchers must verify their accuracy in terms of form and context with constant comparison,[ 27 ] either alone or with peers (a form of triangulation).[ 30 ] The scope and analysis of data included should be as comprehensive and inclusive with reference to quantitative aspects if possible.[ 30 ] Adopting the Popperian dictum of falsifiability as essence of truth and science, attempted to refute the qualitative data and analytes should be performed to assess reliability.[ 31 ]

Generalizability

Most qualitative research studies, if not all, are meant to study a specific issue or phenomenon in a certain population or ethnic group, of a focused locality in a particular context, hence generalizability of qualitative research findings is usually not an expected attribute. However, with rising trend of knowledge synthesis from qualitative research via meta-synthesis, meta-narrative or meta-ethnography, evaluation of generalizability becomes pertinent. A pragmatic approach to assessing generalizability for qualitative studies is to adopt same criteria for validity: That is, use of systematic sampling, triangulation and constant comparison, proper audit and documentation, and multi-dimensional theory.[ 17 ] However, some researchers espouse the approach of analytical generalization[ 32 ] where one judges the extent to which the findings in one study can be generalized to another under similar theoretical, and the proximal similarity model, where generalizability of one study to another is judged by similarities between the time, place, people and other social contexts.[ 33 ] Thus said, Zimmer[ 34 ] questioned the suitability of meta-synthesis in view of the basic tenets of grounded theory,[ 35 ] phenomenology[ 36 ] and ethnography.[ 37 ] He concluded that any valid meta-synthesis must retain the other two goals of theory development and higher-level abstraction while in search of generalizability, and must be executed as a third level interpretation using Gadamer's concepts of the hermeneutic circle,[ 38 , 39 ] dialogic process[ 38 ] and fusion of horizons.[ 39 ] Finally, Toye et al. [ 40 ] reported the practicality of using “conceptual clarity” and “interpretative rigor” as intuitive criteria for assessing quality in meta-ethnography, which somehow echoed Rolfe's controversial aesthetic theory of research reports.[ 41 ]

Food for Thought

Despite various measures to enhance or ensure quality of qualitative studies, some researchers opined from a purist ontological and epistemological angle that qualitative research is not a unified, but ipso facto diverse field,[ 8 ] hence any attempt to synthesize or appraise different studies under one system is impossible and conceptually wrong. Barbour argued from a philosophical angle that these special measures or “technical fixes” (like purposive sampling, multiple-coding, triangulation, and respondent validation) can never confer the rigor as conceived.[ 11 ] In extremis, Rolfe et al. opined from the field of nursing research, that any set of formal criteria used to judge the quality of qualitative research are futile and without validity, and suggested that any qualitative report should be judged by the form it is written (aesthetic) and not by the contents (epistemic).[ 41 ] Rolfe's novel view is rebutted by Porter,[ 42 ] who argued via logical premises that two of Rolfe's fundamental statements were flawed: (i) “The content of research report is determined by their forms” may not be a fact, and (ii) that research appraisal being “subject to individual judgment based on insight and experience” will mean those without sufficient experience of performing research will be unable to judge adequately – hence an elitist's principle. From a realism standpoint, Porter then proposes multiple and open approaches for validity in qualitative research that incorporate parallel perspectives[ 43 , 44 ] and diversification of meanings.[ 44 ] Any work of qualitative research, when read by the readers, is always a two-way interactive process, such that validity and quality has to be judged by the receiving end too and not by the researcher end alone.

In summary, the three gold criteria of validity, reliability and generalizability apply in principle to assess quality for both quantitative and qualitative research, what differs will be the nature and type of processes that ontologically and epistemologically distinguish between the two.

Source of Support: Nil.

Conflict of Interest: None declared.

U.S. flag

An official website of the United States government

The .gov means it’s official. Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

The site is secure. The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

  • Publications
  • Account settings
  • My Bibliography
  • Collections
  • Citation manager

Save citation to file

Email citation, add to collections.

  • Create a new collection
  • Add to an existing collection

Add to My Bibliography

Your saved search, create a file for external citation management software, your rss feed.

  • Search in PubMed
  • Search in NLM Catalog
  • Add to Search

Validity, reliability, and generalizability in qualitative research

Affiliation.

  • 1 Department of Family Medicine, Queen's University, Kingston, Ontario, Canada ; Centre of Studies in Primary Care, Queen's University, Kingston, Ontario, Canada.
  • PMID: 26288766
  • PMCID: PMC4535087
  • DOI: 10.4103/2249-4863.161306

In general practice, qualitative research contributes as significantly as quantitative research, in particular regarding psycho-social aspects of patient-care, health services provision, policy setting, and health administrations. In contrast to quantitative research, qualitative research as a whole has been constantly critiqued, if not disparaged, by the lack of consensus for assessing its quality and robustness. This article illustrates with five published studies how qualitative research can impact and reshape the discipline of primary care, spiraling out from clinic-based health screening to community-based disease monitoring, evaluation of out-of-hours triage services to provincial psychiatric care pathways model and finally, national legislation of core measures for children's healthcare insurance. Fundamental concepts of validity, reliability, and generalizability as applicable to qualitative research are then addressed with an update on the current views and controversies.

Keywords: Controversies; generalizability; primary care research; qualitative research; reliability; validity.

PubMed Disclaimer

Conflict of interest statement

Conflict of Interest: None declared.

Similar articles

  • Police-related triage interventions for mental health-related incidents: a rapid evidence synthesis. Rodgers M, Thomas S, Dalton J, Harden M, Eastwood A. Rodgers M, et al. Southampton (UK): NIHR Journals Library; 2019 May. Southampton (UK): NIHR Journals Library; 2019 May. PMID: 31162918 Free Books & Documents. Review.
  • IMPRoving Outcomes for children exposed to domestic ViolencE (IMPROVE): an evidence synthesis. Howarth E, Moore THM, Welton NJ, Lewis N, Stanley N, MacMillan H, Shaw A, Hester M, Bryden P, Feder G. Howarth E, et al. Southampton (UK): NIHR Journals Library; 2016 Dec. Southampton (UK): NIHR Journals Library; 2016 Dec. PMID: 27977089 Free Books & Documents. Review.
  • The Children's Health Insurance Program Reauthorization Act quality measures initiatives: moving forward to improve measurement, care, and child and adolescent outcomes. Dougherty D, Schiff J, Mangione-Smith R. Dougherty D, et al. Acad Pediatr. 2011 May-Jun;11(3 Suppl):S1-S10. doi: 10.1016/j.acap.2011.02.009. Acad Pediatr. 2011. PMID: 21570012
  • [Reconsidering evaluation criteria regarding health care research: toward an integrative framework of quantitative and qualitative criteria]. Miyata H, Kai I. Miyata H, et al. Nihon Koshu Eisei Zasshi. 2006 May;53(5):319-28. Nihon Koshu Eisei Zasshi. 2006. PMID: 16813064 Japanese.
  • Eliciting public preferences for healthcare: a systematic review of techniques. Ryan M, Scott DA, Reeves C, Bate A, van Teijlingen ER, Russell EM, Napper M, Robb CM. Ryan M, et al. Health Technol Assess. 2001;5(5):1-186. doi: 10.3310/hta5050. Health Technol Assess. 2001. PMID: 11262422
  • The achievement of pre-operative expectations in patients undergoing knee arthroplasty: a cohort study evaluating unique patient goals. Karunaratne S, Harris IA, Horsley M, Trevena L, Solomon M. Karunaratne S, et al. J Patient Rep Outcomes. 2024 Jun 6;8(1):56. doi: 10.1186/s41687-024-00734-8. J Patient Rep Outcomes. 2024. PMID: 38842595 Free PMC article.
  • Evaluation of lived experience Peer Support intervention for mental health service consumers in Primary Care (PS-PC): study protocol for a stepped-wedge cluster randomised controlled trial. Lawn S, Shelby-James T, Manger S, Byrne L, Fuss B, Isaac V, Kaambwa B, Ullah S, Rattray M, Gye B, Kaine C, Phegan C, Harris G, Worley P. Lawn S, et al. Trials. 2024 May 14;25(1):319. doi: 10.1186/s13063-024-08165-y. Trials. 2024. PMID: 38745299 Free PMC article.
  • Factors Influencing the Implementation of Infant Warming Devices Among Healthcare Workers in Malawian Hospitals. Nyondo-Mipando AL, Woo Kinshella ML, Salimu S, Chiwaya B, Chikoti F, Chirambo L, Mwaungulu E, Banda M, Hiwa T, Vidler M, Molyneux EM, Dube Q, Mfutso-Bengo J, Goldfarb DM, Kawaza K. Nyondo-Mipando AL, et al. Glob Pediatr Health. 2024 Apr 30;11:2333794X241248982. doi: 10.1177/2333794X241248982. eCollection 2024. Glob Pediatr Health. 2024. PMID: 38694563 Free PMC article.
  • An Exploration of Common Dyadic Coping Strategies: A Perspective from Pakistani Couples Living with Chronic Conditions. Arshad M, Bibi B. Arshad M, et al. J Relig Health. 2024 May 1. doi: 10.1007/s10943-024-02037-0. Online ahead of print. J Relig Health. 2024. PMID: 38691261
  • Training the next generation of community-engaged physicians: a mixed-methods evaluation of a novel course for medical service learning in the COVID-19 era. Scala JJ, Cha H, Shamardani K, Rashes ER, Acosta-Alvarez L, Mediratta RP. Scala JJ, et al. BMC Med Educ. 2024 Apr 22;24(1):426. doi: 10.1186/s12909-024-05372-8. BMC Med Educ. 2024. PMID: 38649984 Free PMC article.
  • Williams V, Price J, Hardinge M, Tarassenko L, Farmer A. Using a mobile health application to support self-management in COPD: A qualitative study. Br J Gen Pract. 2014;64:e392–400. - PMC - PubMed
  • Wackerbarth SB, Tarasenko YN, Joyce JM, Haist SA. Physician colorectal cancer screening recommendations: An examination based on informed decision making. Patient Educ Couns. 2007;66:43–50. - PMC - PubMed
  • Egbunike JN, Shaw C, Porter A, Button LA, Kinnersley P, Hood K, et al. Streamline triage and manage user expectations: Lessons from a qualitative study of GP out-of-hours services. Br J Gen Pract. 2010;60:e83–97. - PMC - PubMed
  • Khandaker GM, Gandamaneni PK, Dibben CR, Cherukuru S, Cairns P, Ray MK. Evaluating care pathways for community psychiatry in England: A qualitative study. J Eval Clin Pract. 2013;19:298–303. - PMC - PubMed
  • Mangione-Smith R, Schiff J, Dougherty D. Identifying children's health care quality measures for Medicaid and CHIP: An evidence-informed, publicly transparent expert process. Acad Pediatr. 2011;11:S11–21. - PubMed

Related information

  • Cited in Books

LinkOut - more resources

Full text sources.

  • Europe PubMed Central
  • Medknow Publications and Media Pvt Ltd
  • Ovid Technologies, Inc.
  • PubMed Central

Other Literature Sources

  • scite Smart Citations
  • Citation Manager

NCBI Literature Resources

MeSH PMC Bookshelf Disclaimer

The PubMed wordmark and PubMed logo are registered trademarks of the U.S. Department of Health and Human Services (HHS). Unauthorized use of these marks is strictly prohibited.

Log in using your username and password

  • Search More Search for this keyword Advanced search
  • Latest content
  • Current issue
  • Write for Us
  • BMJ Journals

You are here

  • Volume 18, Issue 2
  • Issues of validity and reliability in qualitative research
  • Article Text
  • Article info
  • Citation Tools
  • Rapid Responses
  • Article metrics

Download PDF

  • Helen Noble 1 ,
  • Joanna Smith 2
  • 1 School of Nursing and Midwifery, Queens's University Belfast , Belfast , UK
  • 2 School of Human and Health Sciences, University of Huddersfield , Huddersfield , UK
  • Correspondence to Dr Helen Noble School of Nursing and Midwifery, Queens's University Belfast, Medical Biology Centre, 97 Lisburn Rd, Belfast BT9 7BL, UK; helen.noble{at}qub.ac.uk

https://doi.org/10.1136/eb-2015-102054

Statistics from Altmetric.com

Request permissions.

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

Evaluating the quality of research is essential if findings are to be utilised in practice and incorporated into care delivery. In a previous article we explored ‘bias’ across research designs and outlined strategies to minimise bias. 1 The aim of this article is to further outline rigour, or the integrity in which a study is conducted, and ensure the credibility of findings in relation to qualitative research. Concepts such as reliability, validity and generalisability typically associated with quantitative research and alternative terminology will be compared in relation to their application to qualitative research. In addition, some of the strategies adopted by qualitative researchers to enhance the credibility of their research are outlined.

Are the terms reliability and validity relevant to ensuring credibility in qualitative research?

Although the tests and measures used to establish the validity and reliability of quantitative research cannot be applied to qualitative research, there are ongoing debates about whether terms such as validity, reliability and generalisability are appropriate to evaluate qualitative research. 2–4 In the broadest context these terms are applicable, with validity referring to the integrity and application of the methods undertaken and the precision in which the findings accurately reflect the data, while reliability describes consistency within the employed analytical procedures. 4 However, if qualitative methods are inherently different from quantitative methods in terms of philosophical positions and purpose, then alterative frameworks for establishing rigour are appropriate. 3 Lincoln and Guba 5 offer alternative criteria for demonstrating rigour within qualitative research namely truth value, consistency and neutrality and applicability. Table 1 outlines the differences in terminology and criteria used to evaluate qualitative research.

  • View inline

Terminology and criteria used to evaluate the credibility of research findings

What strategies can qualitative researchers adopt to ensure the credibility of the study findings?

Unlike quantitative researchers, who apply statistical methods for establishing validity and reliability of research findings, qualitative researchers aim to design and incorporate methodological strategies to ensure the ‘trustworthiness’ of the findings. Such strategies include:

Accounting for personal biases which may have influenced findings; 6

Acknowledging biases in sampling and ongoing critical reflection of methods to ensure sufficient depth and relevance of data collection and analysis; 3

Meticulous record keeping, demonstrating a clear decision trail and ensuring interpretations of data are consistent and transparent; 3 , 4

Establishing a comparison case/seeking out similarities and differences across accounts to ensure different perspectives are represented; 6 , 7

Including rich and thick verbatim descriptions of participants’ accounts to support findings; 7

Demonstrating clarity in terms of thought processes during data analysis and subsequent interpretations 3 ;

Engaging with other researchers to reduce research bias; 3

Respondent validation: includes inviting participants to comment on the interview transcript and whether the final themes and concepts created adequately reflect the phenomena being investigated; 4

Data triangulation, 3 , 4 whereby different methods and perspectives help produce a more comprehensive set of findings. 8 , 9

Table 2 provides some specific examples of how some of these strategies were utilised to ensure rigour in a study that explored the impact of being a family carer to patients with stage 5 chronic kidney disease managed without dialysis. 10

Strategies for enhancing the credibility of qualitative research

In summary, it is imperative that all qualitative researchers incorporate strategies to enhance the credibility of a study during research design and implementation. Although there is no universally accepted terminology and criteria used to evaluate qualitative research, we have briefly outlined some of the strategies that can enhance the credibility of study findings.

  • Sandelowski M
  • Lincoln YS ,
  • Barrett M ,
  • Mayan M , et al
  • Greenhalgh T
  • Lingard L ,

Twitter Follow Joanna Smith at @josmith175 and Helen Noble at @helnoble

Competing interests None.

Read the full text or download the PDF:

Europe PMC requires Javascript to function effectively.

Either your web browser doesn't support Javascript or it is currently turned off. In the latter case, please turn on Javascript support in your web browser and reload this page.

  • Corpus ID: 145509060

Understanding Reliability and Validity in Qualitative Research

  • Nahid Golafshani
  • Published 1 December 2003
  • Sociology, Education
  • The Qualitative Report

4,111 Citations

Reliability and validity of qualitative and operational research paradigm, the qualitative report the qualitative report, quality in qualitative organizational research: types of triangulation as a methodological alternative, quality in qualitative studies: the case of validity, reliability and generalizability.

  • Highly Influenced

The Qualitative Research Process: How to Conduct a Qualitative Research?

  • 24 Excerpts

Sashaying Across Party Lines: Evidence of and Arguments for the Use of Validity Evidence in Qualitative Education Research

  • 11 Excerpts

Quantitative Methods for Estimating the Reliability of Qualitative Data

Ethics and validity as core issues in qualitative research: a review paper, generalizability in qualitative research: a tale of two traditions, the use of triangulation in social sciences research : can qualitative and quantitative methods be combined, 33 references, examining the validity structure of qualitative research.

  • Highly Influential

Comprehensive criteria to judge validity and reliability of qualitative research within the realism paradigm

A comparative discussion of the notion of 'validity' in qualitative and quantitative research, reliability and validity in qualitative research, mixing qualitative methods: quality assurance or qualitative quagmire, qualitative research and the question of rigor, choosing qualitative research: a primer for technology education researchers, qualitative research for education : an introduction to theory and methods / by robert c. bogdan and sari knopp biklen, the landscape of qualitative research : theories and issues, introduction to educational research, related papers.

Showing 1 through 3 of 0 Related Papers

validity reliability and generalizability in qualitative research pdf

  • Get new issue alerts Get alerts
  • Submit a Manuscript

Secondary Logo

Journal logo.

Colleague's E-mail is Invalid

Your message has been successfully sent to your colleague.

Save my selection

Validity, reliability, and generalizability in qualitative research

Leung, Lawrence 1,2,

1 Department of Family Medicine, Queen's University, Kingston, Ontario, Canada

2 Centre of Studies in Primary Care, Queen's University, Kingston, Ontario, Canada

Address for correspondence: Prof. Lawrence Leung, Centre of Studies in Primary Care, Queen's University, 220 Bagot Street, Kingston, ON K7L 5E9, Canada. E-mail: [email protected]

In general practice, qualitative research contributes as significantly as quantitative research, in particular regarding psycho-social aspects of patient-care, health services provision, policy setting, and health administrations. In contrast to quantitative research, qualitative research as a whole has been constantly critiqued, if not disparaged, by the lack of consensus for assessing its quality and robustness. This article illustrates with five published studies how qualitative research can impact and reshape the discipline of primary care, spiraling out from clinic-based health screening to community-based disease monitoring, evaluation of out-of-hours triage services to provincial psychiatric care pathways model and finally, national legislation of core measures for children's healthcare insurance. Fundamental concepts of validity, reliability, and generalizability as applicable to qualitative research are then addressed with an update on the current views and controversies.

Nature of Qualitative Research versus Quantitative Research

The essence of qualitative research is to make sense of and recognize patterns among words in order to build up a meaningful picture without compromising its richness and dimensionality. Like quantitative research, the qualitative research aims to seek answers for questions of “how, where, when who and why” with a perspective to build a theory or refute an existing theory. Unlike quantitative research which deals primarily with numerical data and their statistical interpretations under a reductionist, logical and strictly objective paradigm, qualitative research handles nonnumerical information and their phenomenological interpretation, which inextricably tie in with human senses and subjectivity. While human emotions and perspectives from both subjects and researchers are considered undesirable biases confounding results in quantitative research, the same elements are considered essential and inevitable, if not treasurable, in qualitative research as they invariable add extra dimensions and colors to enrich the corpus of findings. However, the issue of subjectivity and contextual ramifications has fueled incessant controversies regarding yardsticks for quality and trustworthiness of qualitative research results for healthcare.

Impact of Qualitative Research upon Primary Care

In many ways, qualitative research contributes significantly, if not more so than quantitative research, to the field of primary care at various levels. Five qualitative studies are chosen to illustrate how various methodologies of qualitative research helped in advancing primary healthcare, from novel monitoring of chronic obstructive pulmonary disease (COPD) via mobile-health technology,[ 1 ] informed decision for colorectal cancer screening,[ 2 ] triaging out-of-hours GP services,[ 3 ] evaluating care pathways for community psychiatry[ 4 ] and finally prioritization of healthcare initiatives for legislation purposes at national levels.[ 5 ] With the recent advances of information technology and mobile connecting device, self-monitoring and management of chronic diseases via tele-health technology may seem beneficial to both the patient and healthcare provider. Recruiting COPD patients who were given tele-health devices that monitored lung functions, Williams et al. [ 1 ] conducted phone interviews and analyzed their transcripts via a grounded theory approach, identified themes which enabled them to conclude that such mobile-health setup and application helped to engage patients with better adherence to treatment and overall improvement in mood. Such positive findings were in contrast to previous studies, which opined that elderly patients were often challenged by operating computer tablets,[ 6 ] or, conversing with the tele-health software.[ 7 ] To explore the content of recommendations for colorectal cancer screening given out by family physicians, Wackerbarth, et al. [ 2 ] conducted semi-structure interviews with subsequent content analysis and found that most physicians delivered information to enrich patient knowledge with little regard to patients’ true understanding, ideas, and preferences in the matter. These findings suggested room for improvement for family physicians to better engage their patients in recommending preventative care. Faced with various models of out-of-hours triage services for GP consultations, Egbunike et al. [ 3 ] conducted thematic analysis on semi-structured telephone interviews with patients and doctors in various urban, rural and mixed settings. They found that the efficiency of triage services remained a prime concern from both users and providers, among issues of access to doctors and unfulfilled/mismatched expectations from users, which could arouse dissatisfaction and legal implications. In UK, a care pathways model for community psychiatry had been introduced but its benefits were unclear. Khandaker et al. [ 4 ] hence conducted a qualitative study using semi-structure interviews with medical staff and other stakeholders; adopting a grounded-theory approach, major themes emerged which included improved equality of access, more focused logistics, increased work throughput and better accountability for community psychiatry provided under the care pathway model. Finally, at the US national level, Mangione-Smith et al. [ 5 ] employed a modified Delphi method to gather consensus from a panel of nominators which were recognized experts and stakeholders in their disciplines, and identified a core set of quality measures for children's healthcare under the Medicaid and Children's Health Insurance Program. These core measures were made transparent for public opinion and later passed on for full legislation, hence illustrating the impact of qualitative research upon social welfare and policy improvement.

Overall Criteria for Quality in Qualitative Research

Given the diverse genera and forms of qualitative research, there is no consensus for assessing any piece of qualitative research work. Various approaches have been suggested, the two leading schools of thoughts being the school of Dixon-Woods et al. [ 8 ] which emphasizes on methodology, and that of Lincoln et al. [ 9 ] which stresses the rigor of interpretation of results. By identifying commonalities of qualitative research, Dixon-Woods produced a checklist of questions for assessing clarity and appropriateness of the research question; the description and appropriateness for sampling, data collection and data analysis; levels of support and evidence for claims; coherence between data, interpretation and conclusions, and finally level of contribution of the paper. These criteria foster the 10 questions for the Critical Appraisal Skills Program checklist for qualitative studies.[ 10 ] However, these methodology-weighted criteria may not do justice to qualitative studies that differ in epistemological and philosophical paradigms,[ 11 12 ] one classic example will be positivistic versus interpretivistic.[ 13 ] Equally, without a robust methodological layout, rigorous interpretation of results advocated by Lincoln et al. [ 9 ] will not be good either. Meyrick[ 14 ] argued from a different angle and proposed fulfillment of the dual core criteria of “transparency” and “systematicity” for good quality qualitative research. In brief, every step of the research logistics (from theory formation, design of study, sampling, data acquisition and analysis to results and conclusions) has to be validated if it is transparent or systematic enough. In this manner, both the research process and results can be assured of high rigor and robustness.[ 14 ] Finally, Kitto et al. [ 15 ] epitomized six criteria for assessing overall quality of qualitative research: (i) Clarification and justification, (ii) procedural rigor, (iii) sample representativeness, (iv) interpretative rigor, (v) reflexive and evaluative rigor and (vi) transferability/generalizability, which also double as evaluative landmarks for manuscript review to the Medical Journal of Australia. Same for quantitative research, quality for qualitative research can be assessed in terms of validity, reliability, and generalizability.

Validity in qualitative research means “appropriateness” of the tools, processes, and data. Whether the research question is valid for the desired outcome, the choice of methodology is appropriate for answering the research question, the design is valid for the methodology, the sampling and data analysis is appropriate, and finally the results and conclusions are valid for the sample and context. In assessing validity of qualitative research, the challenge can start from the ontology and epistemology of the issue being studied, e.g. the concept of “individual” is seen differently between humanistic and positive psychologists due to differing philosophical perspectives:[ 16 ] Where humanistic psychologists believe “individual” is a product of existential awareness and social interaction, positive psychologists think the “individual” exists side-by-side with formation of any human being. Set off in different pathways, qualitative research regarding the individual's wellbeing will be concluded with varying validity. Choice of methodology must enable detection of findings/phenomena in the appropriate context for it to be valid, with due regard to culturally and contextually variable. For sampling, procedures and methods must be appropriate for the research paradigm and be distinctive between systematic,[ 17 ] purposeful[ 18 ] or theoretical (adaptive) sampling[ 19 20 ] where the systematic sampling has no a priori theory, purposeful sampling often has a certain aim or framework and theoretical sampling is molded by the ongoing process of data collection and theory in evolution. For data extraction and analysis, several methods were adopted to enhance validity, including 1 st tier triangulation (of researchers) and 2 nd tier triangulation (of resources and theories),[ 17 21 ] well-documented audit trail of materials and processes,[ 22 23 24 ] multidimensional analysis as concept- or case-orientated[ 25 26 ] and respondent verification.[ 21 27 ]

Reliability

In quantitative research, reliability refers to exact replicability of the processes and the results. In qualitative research with diverse paradigms, such definition of reliability is challenging and epistemologically counter-intuitive. Hence, the essence of reliability for qualitative research lies with consistency.[ 24 28 ] A margin of variability for results is tolerated in qualitative research provided the methodology and epistemological logistics consistently yield data that are ontologically similar but may differ in richness and ambience within similar dimensions. Silverman[ 29 ] proposed five approaches in enhancing the reliability of process and results: Refutational analysis, constant data comparison, comprehensive data use, inclusive of the deviant case and use of tables. As data were extracted from the original sources, researchers must verify their accuracy in terms of form and context with constant comparison,[ 27 ] either alone or with peers (a form of triangulation).[ 30 ] The scope and analysis of data included should be as comprehensive and inclusive with reference to quantitative aspects if possible.[ 30 ] Adopting the Popperian dictum of falsifiability as essence of truth and science, attempted to refute the qualitative data and analytes should be performed to assess reliability.[ 31 ]

Generalizability

Most qualitative research studies, if not all, are meant to study a specific issue or phenomenon in a certain population or ethnic group, of a focused locality in a particular context, hence generalizability of qualitative research findings is usually not an expected attribute. However, with rising trend of knowledge synthesis from qualitative research via meta-synthesis, meta-narrative or meta-ethnography, evaluation of generalizability becomes pertinent. A pragmatic approach to assessing generalizability for qualitative studies is to adopt same criteria for validity: That is, use of systematic sampling, triangulation and constant comparison, proper audit and documentation, and multi-dimensional theory.[ 17 ] However, some researchers espouse the approach of analytical generalization[ 32 ] where one judges the extent to which the findings in one study can be generalized to another under similar theoretical, and the proximal similarity model, where generalizability of one study to another is judged by similarities between the time, place, people and other social contexts.[ 33 ] Thus said, Zimmer[ 34 ] questioned the suitability of meta-synthesis in view of the basic tenets of grounded theory,[ 35 ] phenomenology[ 36 ] and ethnography.[ 37 ] He concluded that any valid meta-synthesis must retain the other two goals of theory development and higher-level abstraction while in search of generalizability, and must be executed as a third level interpretation using Gadamer's concepts of the hermeneutic circle,[ 38 39 ] dialogic process[ 38 ] and fusion of horizons.[ 39 ] Finally, Toye et al. [ 40 ] reported the practicality of using “conceptual clarity” and “interpretative rigor” as intuitive criteria for assessing quality in meta-ethnography, which somehow echoed Rolfe's controversial aesthetic theory of research reports.[ 41 ]

Food for Thought

Despite various measures to enhance or ensure quality of qualitative studies, some researchers opined from a purist ontological and epistemological angle that qualitative research is not a unified, but ipso facto diverse field,[ 8 ] hence any attempt to synthesize or appraise different studies under one system is impossible and conceptually wrong. Barbour argued from a philosophical angle that these special measures or “technical fixes” (like purposive sampling, multiple-coding, triangulation, and respondent validation) can never confer the rigor as conceived.[ 11 ] In extremis, Rolfe et al. opined from the field of nursing research, that any set of formal criteria used to judge the quality of qualitative research are futile and without validity, and suggested that any qualitative report should be judged by the form it is written (aesthetic) and not by the contents (epistemic).[ 41 ] Rolfe's novel view is rebutted by Porter,[ 42 ] who argued via logical premises that two of Rolfe's fundamental statements were flawed: (i) “The content of research report is determined by their forms” may not be a fact, and (ii) that research appraisal being “subject to individual judgment based on insight and experience” will mean those without sufficient experience of performing research will be unable to judge adequately – hence an elitist's principle. From a realism standpoint, Porter then proposes multiple and open approaches for validity in qualitative research that incorporate parallel perspectives[ 43 44 ] and diversification of meanings.[ 44 ] Any work of qualitative research, when read by the readers, is always a two-way interactive process, such that validity and quality has to be judged by the receiving end too and not by the researcher end alone.

In summary, the three gold criteria of validity, reliability and generalizability apply in principle to assess quality for both quantitative and qualitative research, what differs will be the nature and type of processes that ontologically and epistemologically distinguish between the two.

  • Cited Here |
  • Google Scholar

Source of Support: Nil.

Conflict of Interest: None declared.

Controversies; generalizability; primary care research; qualitative research; reliability; validity

  • + Favorites
  • View in Gallery

Readers Of this Article Also Read

“the burden of lifestyle diseases and their impact on health service in..., the measurement of the quality of educational services provided to medical..., determinants of socio-demographic and environmental barriers in prevention of..., financing primary healthcare for rural areas, risk factors and outcome among covid-19 exposed and quarantined healthcare....

  • Open access
  • Published: 18 June 2024

Measurement properties of the Iranian version of the breast cancer perception scale (BCPS) according to the COSMIN checklist

  • Sepideh Mashayekh-Amiri 1 ,
  • Mohammad Asghari Jafarabadi 2 , 3 , 4 ,
  • Mina Hosseinzadeh 5 ,
  • Elham seyed Kanani 6 &
  • Mojgan Mirghafourvand 7 , 8  

BMC Cancer volume  24 , Article number:  743 ( 2024 ) Cite this article

55 Accesses

Metrics details

Breast cancer is a prevalent cancer characterized by its aggressive nature and potential to cause mortality among women. The rising mortality rates and women’s inadequate perception of the disease’s severity in developing countries highlight the importance of screening using conventional methods and reliable scales. Since the validity and reliability of the breast cancer perception scale (BCPS) have not been established in the Iranian context. Therefore, this study aimed to determine the measurement properties of the BCPS in women residing in Tabriz, Iran.

The present study comprised a cross-sectional design, encompassing a sample of 372 Iranian women. The participants were selected through a multi-stage cluster random sampling technique conducted over a period spanning from November 2022 to February 2023. The measurement properties of the Iranian version of BCPS were assessed following the guidelines outlined in the COSMIN checklist. This involved conducting various steps, including the translation process, reliability testing (internal consistency, test-retest reliability, and measurement error), and methodological tests for validity (content validity, face validity, construct validity, and hypothesis testing). The study also investigated the factors of responsiveness and interpretability. The presence of floor and ceiling effects was assessed.

The internal consistency of the scale was assessed using Cronbach’s alpha, yielding a satisfactory value of 0.68. Additionally, McDonald’s omega (95% CI) was computed, resulting in a value of 0.70 (0.66 to 0.74). Furthermore, the test-retest reliability was evaluated, revealing a high intraclass correlation coefficient (ICC) of 0.97 (95% CI: 0.94 to 0.99). The CVI, CVR, and impact scores of the BCPS were determined to be 0.98, 0.95, and 3.70, respectively, indicating favorable levels of content and face validity. To assess construct validity, an examination of the Exploratory Factor Analysis (EFA) was conducted on a set of 24 items. This analysis revealed the presence of six distinct factors, which collectively accounted for 52% of the cumulative variance. The fit indices of the validity model (CFI = 0.91, NFI = 0.96, RFI = 0.94, TLI = 0.90, χ 2 /df = 2.03, RMSEA = 0.055 and SRMR = 0.055) were confirmed during the confirmatory factor analysis (CFA). The overall score of BCPS exhibited a ceiling effect of 0.3%. The floor effect observed in the overall score (BCPS) was found to be 0.5%. Concerning the validation of the hypothesis, Spearman’s correlation coefficient of 0.55 was obtained between the BCPS and the QLICP-BR V2.0. This correlation value signifies a statistically significant association. Furthermore, it is worth noting that the minimum important change (MIC) of 3.92 exhibited a higher value compared to the smallest detectable change (SDC) of 3.70, thus suggesting a satisfactory level of response.

Conclusions

The obtained findings suggest that the Iranian version of the BCPS demonstrates satisfactory psychometric properties for assessing the perception of breast cancer among Iranian women. Furthermore, it exhibits favorable responsiveness to clinical variations. Consequently, it can serve as a screening instrument for healthcare professionals to comprehend breast cancer and as a reliable tool in research endeavors.

Peer Review reports

Breast cancer is a significant global health issue [ 1 ], accounting for approximately 30% of cancer cases among women [ 2 ]. It is recognized as the second-leading cause of mortality in developed nations and the third-leading cause of mortality in less developed nations [ 3 ]. Based on the findings of the Global Cancer Incidence, Mortality and Prevalence (GLOBOCAN) report in 2020, it was determined that there were an estimated 2,261,419 million new cases (1 in 4 new cancer cases) (11.7%), and 684,996 (1 in 6 deaths) (6.9%) fatalities [ 4 ]. It is also predicted that these figures will reach 2,964,197 in 2040 (31% rise from 2018) [ 5 ], and 4.4 million in 2070 (110% rise from 2018) [ 6 ]. According to the report of the American Cancer Society in 2024, the number of new cases of female breast cancer in the United States was 310,720 and the number of deaths was 42,250 [ 7 ]. Approximately two-thirds of these fatalities are documented in regions with lower levels of development [ 4 ]. Alternatively, based on the projection provided by the World Health Organization (WHO), it is anticipated that by 2050, approximately 2.3 million women will receive a diagnosis of breast cancer [ 8 ]. Breast cancer is recognized as a highly costly form of cancer on a global scale, with an estimated annual expenditure of approximately 88 billion dollars. Failing to promptly diagnose and conduct screening examinations, coupled with the consequential impact on the entire family unit, imposes substantial financial burdens on society [ 9 ].

Breast cancer exhibits the highest prevalence and mortality rates among women in the Eastern Mediterranean region (EMR), encompassing Iran when compared to other forms of cancer. Breast cancer is widely recognized as the predominant form of cancer in Iran, ranking as the fifth-highest cause of mortality among women in the country [ 10 ]. The Age-standardized rate (ASR) incidence rate is approximately 28 per 100,000 individuals, exhibiting a recent upward trend [ 11 ]. Based on a systematic review, it has been documented that the incidence rate of breast cancer in Iran stands at 23.6% [ 12 ]. The reported prevalence of this cancer in the United States (US) is approximately 13%, indicating that one out of every eight individuals is affected [ 13 ].

Breast cancer is correlated with numerous risk factors, a significant proportion of which remain unidentified. The findings of a systematic review conducted in 2020 in Iran reveal various risk factors associated with breast cancer. These factors include family history, hormone replacement therapy (HRT), exposure to passive smoking, advanced maternal age during pregnancy, history of abortion, consumption of sweets, and possession of the Arg/Arg genotype. These factors have been found to potentially elevate the risk of developing breast cancer. Conversely, certain factors such as the late onset of menstruation, nulliparity, breastfeeding for a duration of 13 to 24 months, regular physical exercise, and consumption of vegetables have been observed to have a protective effect against the incidence of breast cancer [ 14 ].

It is noteworthy that the incidence of breast cancer among Iranian women occurs at an age approximately 10 years earlier compared to women in other developed nations. According to various studies, there has been a documented increase in the prevalence of breast cancer among women under the age of 40 in recent years [ 15 ]. The rise in its occurrence in developing nations is primarily attributed to alterations in lifestyle and reproductive behaviors [ 16 ]. The majority of female individuals afflicted with breast cancer receive a diagnosis during the later stages of the ailment, thereby correlating with an elevated mortality rate. Hence, it has been observed that early detection of breast cancer leads to a significant improvement in both survival rates and treatment outcomes, with a reported increase of 90% [ 17 ].

The United States Office of Disease Prevention and Health Promotion’s Healthy People 2020 initiative has as its goals the improvement of breast cancer diagnostic procedures for women, a decrease in the prevalence of cases of end-stage cancer, and a reduction in breast cancer mortality rates. Conversely, in the case of cancers that exhibit both genetic and environmental risk factors, it is imperative to adopt strategies that prioritize modifiable risk factors and early detection. Hence, the implementation of a preventive strategy aimed at early detection, which incorporates the evaluation of knowledge on breast cancer and its associated risks, assumes paramount significance [ 18 ].

The perception of breast cancer is a crucial subjective psychological phenomenon that is associated with the evaluation of potential threats. This evaluation is linked to an individual’s assessment of their susceptibility to the disease and the probability of gaining advantages from engaging in preventive measures [ 15 ]. Various studies have indicated that risk perception is a significant determinant of preventive health-related behaviors, such as screening. The motivation to undergo screening tests can be influenced by individuals’ perceptions of the risk associated with breast cancer [ 16 ]. According to the literature, screening tests play a crucial role in mitigating complications and mortality associated with breast cancer [ 17 ].

Various studies have documented divergent findings regarding the correlation between the perception of breast cancer risk and the utilization of screening tests, such as mammography [ 18 ]. Research findings indicate that the implementation of mammography screening during the age range of 40 to 49 years has been associated with a reduction in mortality rates of approximately 15 to 20% [ 19 ].

According to a study conducted on families to assess their perception of breast cancer risk, the rate of adherence to screening tests in Germany was found to be 83% [ 20 ]. Conversely, a research study conducted in Iran examined the adherence rate of women aged 35 to 69 years to mammography, as recommended by screening programs. The findings revealed that in urban areas, the adherence rate was 8.3%, while in rural areas, it was 3.16% [ 1 ]. Therefore, the perception level that women possess regarding breast cancer has the potential to influence their subsequent actions, such as seeking medical evaluation and undergoing screening procedures like breast self-examination (BSE), clinical breast examination (CBE), and mammography. Therefore, it is imperative to assess the perception of breast cancer in women using a multidimensional approach [ 20 ].

Numerous methodologies have been suggested for assessing the perception of breast cancer risk, which can be categorized into two distinct types: evaluation of the objective perception of risk (i.e., actual risk) and evaluation of the subjective perception of risk [ 21 ]. Currently, Gill’s model predominantly serves as a tool for conducting quantitative risk assessments. This approach aims to objectively evaluate the actual risk by considering the attributes of risk factors [ 22 ]. The second method entails the assessment of self-perceived risk, which can be anticipated by gauging individuals’ mental perceptions using a visual analog scale (VAS). Despite the presence of a multitude of tools within this domain, their practicality appears to be limited as they do not provide comprehensive coverage of all the factors that influence behaviors related to the diagnosis of breast cancer [ 23 ].

Taylan et al. (2021) developed the BCPS in Turkey, considering the health belief model for the first time. This scale encompasses various domains, including Perceived knowledge, Perceived treatment belief, Perceived need for health check, Perceived stigma, Perceived fear and Perceived risk. The utilization of this scale offers several benefits in assessing women’s perceptions regarding the factors influencing breast cancer diagnostic behavior comprehensively. Furthermore, it is worth noting that the extent of women’s perceived knowledge of breast cancer has not been quantitatively evaluated thus far. Consequently, this tool serves as a distinctive scale specifically designed to measure women’s knowledge regarding cancer. Additionally, it quantifies the dimensions of the breast [ 23 ].

The Health Belief Model (HBM) was initially formulated by Becker et al., in 1974 to comprehend health-related protective behaviors [ 24 ]. The evaluation of perceived risk, employing the HBM, has been validated in various studies examining screening behaviors, such as those related to breast cancer diagnostics [ 25 , 26 , 27 ]. The model encompasses various dimensions, namely perceived sensitivity, perceived severity, perceived benefits, perceived barriers, self-efficacy, and guidance for action. Based on the presented model, individuals’ healthcare behaviors can be subject to influence from factors such as perception, beliefs, values, and attitudes. By identifying and modifying an individual’s perceptions, beliefs, and attitudes, the effectiveness of healthcare education or treatment can be enhanced [ 26 ].

However, it is crucial to assess the methodological rigor of studies that evaluate the measurement properties of instruments used to measure health-related patient-reported outcomes (HR-PROs) [ 28 ]. The Consensus-Based Standards for the Selection of Health Status Measurement Instruments (COSMIN) checklist was developed by Mokkink et al. in 2010 through a consensus-based approach utilizing the Delphi method [ 28 ]. The COSMIN list is widely regarded as a highly comprehensive set of criteria for selecting an appropriate tool. It serves as a valuable guide for researchers, offering a range of logical indicators that aid in the process of tool selection [ 29 ].

Given the rising incidence of breast cancer, the significance of early screening, and the potential influence of risk perception on women’s adoption of preventive behaviors, such as breast screening methods, it is pertinent to assess the level of knowledge regarding breast cancer when devising interventions aimed at modifying health behaviors. It is worth noting that the validity and reliability of the aforementioned assessment tool have not been established in Iran. The present study was undertaken to conduct a measurement propertice of the BCPS in women residing in Tabriz city-Iran, by using the COSMIN checklist.

The present study was conducted with the aim of determining the measurement properties of the breast cancer risk perception scale (BCPS) in according to COSMIN checklist in women in Tabriz, Iran.

Validity procedure

The measurement properties of the Iranian version of BCPS were assessed following the guidelines outlined in the COSMIN checklist [ 29 ]. This involved conducting various steps, including the translation process, reliability testing (internal consistency, test-retest reliability, and measurement error), and methodological tests for validity (content validity, face validity, construct validity, and hypothesis testing). The study also investigated the factors of responsiveness and interpretability. The presence of floor and ceiling effects was assessed.

Translation process

Initially, permission to use the BCPS was obtained by sending an email from the original designer of the instrument (Taylan et al.) [ 23 ]. Efforts were made to maintain the integrity of the original intent during the translation process. Following the recommendations made by the WHO, EORTC Quality of Life Group Translation Procedure Guidelines and expert panel review, this was performed [ 30 ]. The translation process involves the utilization of two distinct methods. The two methods utilized in this study are the Forward-Backward method (FB) and the Dual Panel method (DP), which were implemented throughout four distinct stages. The process consisted of four stages: forward translation, backward translation, pre-testing and cognitive interviewing, and the final version.

During the initial phase of translation, the original English version of the instrument was administered to two individuals who were native Persian speakers, proficient in English, and possessed expertise in the development of the instrument as well as knowledge in the field of breast cancer. Translators were subsequently instructed to translate the tool in a fully autonomous and individual manner, with a focus on conceptual rather than literal translations. Additionally, they were encouraged to use language that would be comprehensible to the majority of the target audience. Ultimately, two translators looked into the discrepancies between the two translated versions, which led to a reconciled translation. Subsequently, the identified issues were addressed, leading to the presentation of a unified version [ 31 ]. Subsequently, the backward translation method was employed to guarantee a comprehensive correspondence between the Persian translation and the original version. The translated questionnaire from the preceding stage was administered to two individuals who are native English speakers. These individuals were not involved in the forward translation process and had no prior exposure to the original version of the questionnaire. They were instructed to retranslate the questionnaire back into English. The concluding report at the culmination of this phase encompassed the following components: two forward translations from the English language to Farsi, a reconciled translation, two backward translations from Farsi to English, and the incorporation of any supplementary remarks regarding the translations provided by the panel of experts. Ultimately, before implementing the tool in the intended population, it is imperative to conduct a pilot study. To achieve the intended objective, a questionnaire was administered to a sample of ten qualified female participants. Based on the feedback received from these participants regarding the ease of completing the instrument, grammar, comprehensibility, and writing style, modifications were made to the Persian version, and the revised version was finalized and presented [ 31 ].

Validation study

This cross-sectional study aimed to assess the measurement properties of the Iranian version of the BCPS among a sample of 372 Iranian women who sought healthcare services at Tabriz health centers affiliated with Tabriz University of Medical Sciences in two separate secondary samples (172 participants via exploratory factor analysis and 200 participants via confirmatory factor analysis). The study was conducted following the approval of the Ethics Committee of Tabriz University of Medical Sciences (ref: IR.TBZMED.REC.1401.390) from November 2022 to February 2023. It is important to acknowledge that informed consent was obtained from all participants. The study was conducted in adherence to the applicable regulations of the Ethics Committee of the University of Medical Sciences and the Declaration of Helsinki.

Among the 410 women, it was found that 16 participants did not satisfy the predetermined eligibility criteria, leading to their exclusion from the research investigation. Out of the remaining 394 participants who met the eligibility criteria, a total of 22 participants expressed their unwillingness to participate in the study. Ultimately, a total of 372 participants were incorporated into the research investigation. The response rate for the study was 94%, (372/394). In the sampling procedure, the cluster method was employed to randomly select a quarter of the 92 health centers in Tabriz City. The selection process was facilitated through the utilization of the website ( www.random.org ). In addition to their contact information, which was obtained from the SIB system (integrated health system), women were selected at random from the compiled list. To clarify, the selection of women from each center was determined based on the proportional sampling method, and the process of randomly selecting women was carried out utilizing the aforementioned website. Following a telephone conversation with the participants, wherein the researcher offered a concise overview of the research, the researcher proceeded to invite the women to attend the designated health center within the specified timeframe. The purpose of this visit was to provide additional explanations and administer the questionnaires. Upon conducting a visit and assessing the fundamental aspects such as basic information and inclusion and exclusion criteria, the individual proceeded to furnish the concerned parties with extensive details related to the research, its advantages, outcomes, and the confidentiality of the data. It is important to acknowledge that the random selection of participants was conducted before the assessment of their eligibility criteria. Following their visit to the health center, participants underwent a comprehensive evaluation to gather baseline data and determine their eligibility. Full information regarding the research objectives and methodology was exclusively provided to individuals who satisfied the predetermined eligibility criteria, and only they were extended an invitation to participatein the study. After agreeing to participate in the study, the participants proceeded to complete the informed consent form, the questionnaire of socio-demographic characteristics, and the BCPS.

The study’s inclusion criteria encompassed women who were at least 20 years old, exhibited no indications of abnormal breast lesions during clinical examination, and possessed the necessary literacy skills to complete the questionnaire. The study excluded individuals who met the following criteria: a confirmed diagnosis of breast cancer as documented in medical records, a history of cosmetic breast surgery, and impairment in communication skills related to hearing and speaking, and an inability to physically, cognitively, or mentally respond to questions.

Socio-demographic questionnaire

The questionnaire included questions regarding socio-demographic factors such as age, spouse age, marital status, educational level, job, income, breast cancer history, history of hormone therapy, family history of breast cancer and menopause status.

Breast cancer perception scale

Taylan et al. (2021) in Turkey [ 23 ] developed the BCPS. The present tool is grounded in the theoretical framework of the HBM and comprises a set of 24 items designed to assess women’s perceptions of breast cancer. The construct comprises six sub-dimensions, including Perceived knowledge, Perceived treatment belief, Perceived need for health check, Perceived stigma, Perceived fear and Perceived risk, which are assessed using a five-point Likert scale. The responses span a spectrum from strongly disagree (1) to strongly agree (5). The scale’s validity and reliability have been empirically established within the specific demographic of Turkish women in 2021. The lower bound of the scoring range for this questionnaire is 24, while the upper bound is set at 120. A positive correlation exists between higher scores and a greater level of women’s perception of breast cancer [ 23 ].

Sample size determination

It is imperative to ascertain the appropriate sample size to conduct the factor analysis procedure. According to a rule of thumb, the classification of sample size for EFA is as follows: a sample size of 50 is considered very poor, 100 is poor, 200 is fair, 300 is good, 500 is very good, and 1000 is excellent [ 32 ]. To ensure the reliability and validity of the results, it is necessary to have an appropriate sample size when conducting factor analysis. This study incorporated the guidelines proposed by Nunnally [ 33 ], which recommend a sample size of 5 to 10 samples for each instrument question to facilitate the generalizability of the findings to the broader community. Under these guidelines, it was deemed appropriate to utilize a sample size of 10 samples for each case, taking into account that the tool consisted of 24 items. Therefore, initially, a minimum of 240 samples were considered necessary. Nevertheless, it is imperative to take into account the effectof the cluster sampling technique employed in the research. The cluster sampling method introduces a factor of intra-cluster correlation that necessitates its inclusion in the calculation of the sample size [ 33 ]. To address this issue, a design effect of 1.5 was employed to modify the sample size. Due to a 10% attrition rate, the sample size has consequently expanded to encompass a total of 372 participants.

Statistical analysis

The statistical analysis was conducted using the SPSS software package (version 16, IBM Corp., Armonk, NY, USA), STATA14 (Statcorp, College Station, Texas, USA), and R software 4.2 (Psych package). To examine socio-demographic data, descriptive statistics were employed, including frequency (percentage) for qualitative variables, minimum and maximum values, and mean ± standard deviation (SD) for quantitative variables. The evaluation encompassed methodological testing, which involved the assessment of reliability, validity, and responsiveness. Exploratory factor analysis (EFA) and confirmatory factor analysis (CFA) techniques were employed to assess construct validity on a larger scale. The direct oblimin method was employed in the EFA. Bartlett’s test for sphericity and KMO’s test for assessing the adequacy of scale content and sample size were conducted. The CFA methodology was employed to assess the factor structure and factor loadings of the scale. In conclusion, an assessment was conducted to evaluate the reliability of the study, specifically focusing on internal consistency, test-retest reliability, and measurement error. Finally, the presence of ceiling and floor effects was assessed.

Methodological testing according to the COSMIN checklist

  • Reliability

Reliability refers to the extent to which a measurement is devoid of any errors that may arise during the measurement process. The evaluation of reliability primarily involves the assessment of three key characteristics: internal consistency, test-retest reliability, and measurement error [ 29 ].

Internal consistency

Internal consistency refers to the extent of interconnectedness among items. It serves as an estimation of the correlation level between the variables that constitute the intended structure or instrument [ 29 ]. The internal consistency of the instrument as a whole and its six subscales was assessed using both Cronbach’s alpha coefficient and McDonald’s omega coefficient. A minimum threshold of 0.7 was deemed necessary for both Cronbach’s alpha and MacDonald’s Omega coefficients to establish satisfactory internal consistency [ 34 ].

Test–retest reliability

Test-retest reliability refers to the extent to which the outcomes of a patient with identical health conditions remain consistent over a while [ 29 ]. Following the guidelines outlined in the COSMIN manual, a test-retest procedure was conducted with a minimum interval of two weeks. This time frame was chosen to prevent participants from recalling their previous responses and to account for any potential changes in their health status [ 29 ]. To achieve the intended objective, a survey was administered to a cohort of 30 female participants on two separate occasions, with a 14-day gap between each administration. The resulting scores were subsequently utilized to assess the reliability of the survey instrument through the application of the intraclass correlation coeficient (ICC). A reliability coefficient greater than 0.7 was deemed advantageous [ 34 ].

Measurement error

Measurement error is considered one of the key indicators of measurement and test reliability. In essence, it refers to the presence of both systematic and random errors in the patient’s score, which cannot be attributed to genuine variations in the construct under consideration. The calculation of the standard error of measurement (SEM) involves the use of the formula (SEM = SD√1-ICC), where SD represents the standard deviation [ 34 ]. The concept of the smallest detectable change (SDC) pertains to the minimum magnitude of an individual score change that can be accurately interpreted as a genuine change. The calculation of the SDC is determined by employing the formula (SDC = SEM*1.96*√2). A reduced level of the SDC corresponds to an increased level of measurement sensitivity [ 34 ].

Validity refers to the extent to which a given instrument accurately measures the specific characteristic it is designed to assess [ 29 ].

Face validity

Face validity is a concept that pertains to the extent to which the items within an instrument, specifically the HR-PRO, accurately represent the underlying construct that is intended to be measured [ 29 ]. The researchers conducted an assessment of face validity using both qualitative and quantitative methods. To conduct a qualitative assessment of face validity, a sample of 10 women from health centers in Tabriz City was selected using a convenience sampling method. This sample then examined the initial questionnaire. The participants assessed the quality, level of difficulty, lack of relevance, and degree of ambiguity of the items. To evaluate the face validity, item impact scores were quantitatively computed. During this phase, the aforementioned participants assessed each item on a 5-point Likert scale, ranging from “completely important” to “not at all important,” with scores ranging from 5 to 1 (representing “completely important,” “important,” “moderately important,” “slightly important,” and “not important,” respectively). The impact score is calculated by multiplying the Frequency (expressed as a percentage) by the Importance (Impact Score = Frequency (%) × Importance). Items with an impact score exceeding 1.5 were deemed appropriate and were subsequently retained for further stages of analysis [ 35 ].

Content validity

The degree to which the content of an HR-PRO instrument effectively represents the construct that is intended to be assessed [ 29 ]. Both qualitative and quantitative methods were used to examine the validity of the content of the questionnaire. To assess the credibility of the qualitative content, a group of ten experts, including three experts in reproductive health, two specialists in midwifery, three specialists in medical-surgical nursing, and two specialists in community health nursing, were invited to provide their insights and opinions on topics considering grammar, vocabulary choice, item arrangement, and scoring.

The inclusion criteria of the experts to determine content validity include voluntary participation, faculty members with the rank of assistant professor and above, PhDs of midwifery/nursing and individuals with clinical experience in breast cancer. The process of assessing quantitative content validity involves the calculation of two measures: the content validity ratio (CVR) and the content validity index (CVI) [ 36 ]. To fulfill the objective, a questionnaire comprising questions organized into two overarching categories was distributed to each expert. In the initial phase, the participants assessed the items using a 3-point Likert scale (necessary, useful but not necessary, not necessary) to ascertain the CVR, which was computed using the following mathematical expression:

CVR= (Ne-N/2)/ (N/2).

Where, “Ne” represents the count of experts who have chosen the “necessary” option, and N denotes the total number of experts. Regarding Lawshe table, a CVR > 0.62 for a sample size of 10 individuals, confirms the essentiality of the items under investigation [ 37 ].

Subsequently, the CVI review underwent evaluation by an identical group of 10 experts. Concerning this matter, questions have been raised regarding the three criteria of relevance, clarity, and simplicity for each item. These criteria have been assessed using a four-point Likert scale, which includes options such as irrelevant, somewhat relevant, relevant, and completely relevant. The assessment is based on the content validity index [ 38 ] developed by Waltz and Basel. The level of relevance, clarity, and simplicity was assessed by experts based on their subjective evaluation, and then the CVI was computed using the following formula:

CVI = number of experts giving a rating of 3 and 4 / total number of experts.

CVIs higher than 0.79, between 0.70 and 0.79, and less than 0.70 were considered acceptable, in need of correction, and unacceptable, respectively [ 39 ].

Construct validity

The concept of construct validity pertains to the extent to which the scores obtained from an HR-PRO instrument align with the anticipated hypotheses. This alignment can be observed in terms of internal relationships, relationships with scores obtained from other instruments, or differences between relevant groups. This assessment is contingent upon the assumption that the HR-PRO instrument possesses validity. The concept of validity pertains to the extent to which a given measure accurately assesses the construct it is intended to measure. The three aspects encompassed in this study are as follows: structural validity, which pertains to the internal relationships within the construct; hypothesis testing; and cross-cultural validity, which focuses on the relationships with scores on other instruments or differences between relevant groups [ 29 ].

Structural validity

The suitability of the data for EFA was assessed by employing the Kaiser-Meyer Olkin (KMO) criterion and Bartlett’s test of sphericity. The KMO test is a statistical measure that quantifies the proportion of variance in the questions that can be attributed to the primary factors. Typically, values falling within the range of 0.8–1 are indicative of adequate data sampling to conduct factor analysis. However, when the value of the statistic falls below 0.7, it indicates that the sample size is insufficient, necessitating the implementation of corrective actions [ 40 ].

Bartlett’s test of sphericity is a frequently employed statistical test to assess the appropriateness of data for factor analysis. The significance of this test serves as an indicator of the suitability of the data for factor analysis [ 40 ]. The process of extracting factors from the 24 items of the questionnaire was conducted using the principal component analysis method, employing varimax rotation (direct oblimin). The determination of the number of factors was based on the criterion of an Eigenvalue greater than 1 and the examination of the Scree plot. In this analysis, a minimum factor loading threshold of 0.3 was utilized for the extraction of factors. In contrast, CFA employs the maximum likelihood method to estimate the model’s fit indices, and a range of indices are utilized to assess the appropriateness of the model. This study assessed the adequacy of the model by employing the indicators outlined below [ 41 ]:

Root mean score error of approximation (RMSEA < 0.08), standardized root mean square residual (SRMR < 0.10), normed Chi 2 (x 2 / df) < 5, comparative fit indices including comparative fit index (CFI > 0.90), Bentler-Bonett Normed Fit Index (NFI) > 0.90, Relative fit index (RFI) > 0.90 and Tucker-Lewis Index (TLI) > 0.90.

Hypothesis testing

The process of hypothesis testing is characterized by its continuous and iterative nature. Hypotheses serve as a means to express the anticipated direction and magnitude of correlations or differences between the construct under investigation and other constructs. As the number of hypotheses tested regarding the alignment between the data and pre-existing hypotheses increases, a greater amount of evidence supporting construct validity is accumulated [ 29 ]. To assess construct validity, an analysis of the hypotheses that were previously formulated was conducted. In this study, it was postulated that the BCPS would exhibit a strong correlation with other subjective scales, such as quality-of-life instruments for cancer patients (QLICP-BR V2.0). Hence, confirmation of the desired hypothesis can be achieved when the Pearson correlation coefficient exceeds 0.5. Furthermore, the study computed the floor and ceiling effect (F/C) as well as the proportion of women who achieved the minimum and maximum scores. F/C effects refer to the percentage of individuals who achieve the highest (ceiling) or lowest (floor) possible scores within a specific domain. These effects serve as indicators of a questionnaire’s sensitivity and coverage at the extreme ends of the scale. In the context of this study, a problematic scenario is defined as a situation where 15% or more of the respondents fall into either the ceiling or floor category [ 42 ].

Responsiveness

Measurement instruments should possess a high degree of sensitivity to detect and accurately capture changes, while also demonstrating a responsive nature to promptly reflect these changes. According to the COSMIN checklist, responsiveness refers to the capacity of an HR-PRO instrument to accurately identify alterations in the construct being assessed over a while [ 29 ]. Terwee et al. [ 34 ] argue that responsiveness can be assessed by examining the relationship between the smallest detectable change (SDC) and the minimally important change (MIC). If the value of SDC is less than MIC, then the responsiveness is confirmed.

Interpretability

Interpretability refers to the extent of qualitative significance, specifically the minimally important changes (MIC) within the instrument. The extent to which quantitative instrument scores or changes in scores can be attributed to qualitative meaning, such as clinical or commonly understood meanings, has been discussed [ 29 ]. The estimation of the minimum important change (MIC) was conducted by dividing the standard deviation (SD) by two, as outlined in the study conducted by Norman et al. [ 43 ].

Descriptive characteristics of participants

This study involved the participation of 372 women. The participants were randomly split into two groups, one group of 172 participants for EFA and another group of 200 participants for CFA. The average age was 52.7 and 52.3 years with a standard deviation of 9.5 and 8.5 years in EFA and CFA group, respectively. A significant majority of the individuals surveyed were married (78.5%, 80.5% in EFA and CFA group, respectively), and 73.8%, 61.0% of them identified themselves as housewives in EFA and CFA group, respectively. Table  1 provides a summary of the additional socio-demographic characteristics of the two groups of participants.

In the present study, the mean (SD) of the entire BCPS scale was 61.66 (8.44), with a range of obtainable scores from 24 to 120. The mean (SD) for the six extracted factors, namely Perceived fear, Perceived knowledge, Perceived treatment belief, Perceived risk, Perceived need for a health check, and Perceived stigma, were respectively 8.28 (4.02), 11.63 (3.79), 10.5 (2.41), 9.37 (2.12), 10.19 (3.22), and 12.15 (2.65).

The values of Cronbach’s alpha and McDonald’s omega (95% CI) were found to be 0.68 and 0.70 (0.66 to 0.74), respectively. These results suggest that the questionnaire exhibits satisfactory internal consistency. Also, the ICC (95% CI) gave a value of 0.97 (0.94 to 0.99). Standard error of measurment is a statistical metric utilized to assess the accuracy and consistency of a given measurement. The SEM value in this study was determined to be 1.36. This implies that upon conducting multiple iterations of the measurement, it is anticipated that the recorded values will fall within a range of ± 1.36 units with the actual score. Moreover, the SDC denotes the smallest detectable change that can be consistently detected by the measuring apparatus. Within the given framework, the value of SDC was ascertained to be 3.73 units. This implies that any deviation in the measured quantity that is less than 3.73 units may not be discernible due to measurement errors and can be regarded as insignificant (Table  2 ).

The tool’s content and face validity were assessed using the CVI (CVI range: 0.87–1.00), CVR (CVR range: 0.75–1.00), and impact scores (3.06–4.00), which yielded values of 0.98, 0.95, and 3.70, respectively.

The construct validity investigation involved conducting an EFA on a set of 24 items. The resulting Kaiser-Meyer-Olkin (KMO) value of 0.71 was obtained at a statistically significant level of less than 0.001, indicating that the sample size in the current study was sufficient. Furthermore, the statistical analysis revealed that Bartlett’s test of sphericity yielded a significant result ( p  ≤ 0.001), indicating that factor analysis was appropriately conducted based on the correlation matrix in the sample under investigation.

The scree plot displayed the results of EFA, revealing six factors with eigenvalues > 1. These factors collectively accounted for 52% of the variance (Fig.  1 ). Table  3 presents the extracted components alongside the corresponding items associated with each factor. The initial factor examined in this study was Perceived fear, comprising a set of four questions that contributed to 13.60% of the overall variance. The second factor, referred to as Perceived knowledge, comprises a set of four questions that collectively account for 11.00% of the overall variance. The third and fourth factors were Perceived treatment belief and Perceived risk, respectively. These factors consisted of four and two questions, respectively, and accounted for 8.9% and 6.4% of the variance. The fifth factor, referred to as Perceived need for a health check, consists of four questions with a variance of 6.3%. Additionally, the sixth factor, known as “Perceived stigma,” comprises four questions that account for 5.8% of the total variance (Fig.  2 ). After preliminary psychometric testing, 24 items were factor-analysed providing a 22-item, six-factor scale. It is important to highlight that, in the original instrument, questions 9 and 23, respectively addressed the notions that “Breast cancer treatment does not change the outcome” and “The risk for breast cancer is higher in those with a family history of breast cancer,” were excluded from the EFA in the present study due to their factor loadings being less than 0.3. Consequently, there was a reduction in the number of instrument questions from 24 items in the original instrument to 22 items.

figure 1

Factor load scree plot of the items for determining the number of extracted factors of the Iranian version of BCPS

figure 2

Factor structure model of the BCPS based on CFA. All factor-item relationships were significant ( P  < 0.05). Fc1: Perceived fear, Fc2: Perceived knowledge, Fc3: Perceived treatment belief, Fc4: Perceived risk, Fc5: Perceived need for health check, Fc6: Perceived stigma

CFA was employed to examine the six factors that were derived from EFA. The findings indicate that the model has attained a level of fit that is considered optimal, thereby providing support for confirming the factor structure. The indicator \(\raisebox{1ex}{${x}^{2}$}\!\left/ \!\raisebox{-1ex}{$df$}\right.\) is found to be 2.029 (χ2 = 393.781, df = 194, P-value < 0.001). Additionally, the fit indexes TLI, CFI, NFI, and RFI all exceed the threshold of 0.9. Furthermore, the RMSEA and SRMR index values are both equal to 0.055, indicating a valid model.

Hypothesis testing, responsiveness and interpretability

The hypothesis confirmation involved the computation of Spearman’s correlation coefficient between the BCPS and QLICP-BR V2.0, and the resulting coefficient of 0.55 indicated a statistically significant correlation. To assess the feasibility of the tool, the ceiling effect in the overall score of BCPS was found to be 0.3%. In the sub-domains, the ceiling effects for Perceived fear, Perceived knowledge, Perceived treatment belief, Perceived risk, Perceived need for a health check, and Perceived stigma were determined to be 2.2%, 0.4%, 0.3%, 1.1%, 0.3%, and 0.8%, respectively. The floor effect in the overall score of BCPS was observed to be 0.5%, while in the specific subdomains, it was found to be 26.9%, 3.5%, 3.2%, 1.3%, 4.6%, and 0.8%, respectively. It is noteworthy to mention that the MIC refers to a specific threshold value that delineates the smallest alteration in the measured parameter that holds clinical or practical significance. In this particular instance, the MIC was determined to be 3.92 units. Specifically, the study reveals that the MIC value surpasses the SDC value by 3.73 units. This observation indicates that the Iranian version of the measurement tool is sufficiently responsive. Put simply, the measurement tool can accurately identify and assess changes that hold significance or relevance within the given measurement framework. SEM of this study’s findings generally implies that the measuring device used exhibits a satisfactory level of precision. The comparison between the SDC and the MIC values further demonstrates the instrument’s capacity to consistently identify significant variations in the measured variable (Table  2 ).

The aim of this study was to assess the measurment properties of the Breast Cancer Perception Scale (BCPS) in Iranian women, according to the COSMIN checklist for the first time. The findings of the research substantiate the validity, reliability, responsiveness, and interpretability of the BCPS, which is grounded in the health belief model (HBM) when applied to Iranian women.

The HBM has been widely employed in the study of breast cancer diagnostic behaviors for an extended period [ 25 , 26 , 27 ]. The BCPS is a novel screening tool for breast cancer that has been developed utilizing the HBM [ 23 ]. Despite the presence of various tools in this domain, such as “belief in mammography” and “breast self-examination,” “perceived sensitivity toward breast cancer,” “perceived benefits and obstacles of mammography usage,” “fear of breast cancer (FBC)”, and “fatalism regarding cancer “, these instruments appear to lack practicality as they assess factors individually and fail to encompass all relevant domains. The BCPS demonstrates utility in its comprehensive coverage of various domains, particularly in assessing previously unmeasured aspects such as perceived knowledge, and mental measurements, including the perceived need for a health check, perceived stigma, perceived fear, and perceived risk [ 23 ].

Breast cancer perception is one of the most important indicators for preventing breast cancer and adopting protective behaviors against breast cancer. Proper perception of breast cancer serves as a motivator for women to adhere to breast cancer prevention methods. Despite the existence of various preventive and diagnostic methods for breast cancer, none of these methods will be effective until there is a proper perception of breast cancer. Therefore, the BCPS scale, considering important dimensions such as Perceived knowledge, Perceived treatment belief, Perceived need for health check, Perceived stigma, Perceived fear and Perceived risk, can play an important role in creating preventive behaviors against breast cancer [ 23 ].

In the current investigation, EFA was conducted on a set of 24 items of the instrument. The analysis yielded six factors, namely Perceived fear, Perceived knowledge, Perceived treatment belief, Perceived risk, Perceived need for a health check, and Perceived stigma. These factors aligned with the original instrument and collectively accounted for approximately 52% of the variance, while in the original instrument, they accounted for 74.36% of the variance [ 23 ]. To assess the validity of the instrument, the KMO measure was computed, yielding a value of 0.71. Additionally, the adequacy of the model was verified through Bartlett’s test, which yielded a significance level of 0.77 in the original study [ 23 ]. Furthermore, the reliability of the instrument was obtained ranging from 0.64 to 0.94 by using Cronbach’s alpha, and these values align with the original study’s reported range of 0.81 to 0.95 [ 23 ].

In the present study, the initial factor extracted during the exploratory EFA was identified as perceived fear. The influence of perceived fear on women’s adoption of protective behaviors against breast cancer can be observed. The findings of various studies indicate that a significant majority of women encounter fear regarding the potential diagnosis of breast cancer and the subsequent possibility of undergoing a mastectomy, either unilaterally or bilaterally, at some point in their lives [ 44 , 45 ]. In a similar vein, a separate study indicated that women who exhibited a heightened FBC were found to undergo mammograms less frequently within a one-year timeframe in comparison to their counterparts [ 46 ]. In the study conducted by Aguirre et al., it was observed that young Spanish women exhibited a notable level of fear towards breast cancer, despite not expressing a general sense of concern regarding the disease. According to the study [ 47 ], it was found that 25.3% of the participants reported above-average FBC, while 59.7% reported high FBC. This finding implies that breast cancer may elicit a particularly strong sense of fear, even among young women who do not have significant health issues and have a low objective risk. This observation aligns with the findings of previous research [ 48 , 49 ]. Furthermore, when comparing the findings of this study to previous research conducted within the past two decades, it becomes evident that the extent of fear induced by breast cancer has remained relatively stable despite favorable epidemiological advancements such as reduced mortality rates and enhanced treatment options [ 50 ].

The second factor that was extracted in this study related to perceived knowledge. Perceived knowledge encompasses biases, such as unrealistic optimism and implicit confidence [ 51 ]. The concept of perceived knowledge pertains to an individual’s level of knowledge and is not directly associated with one’s knowledge, specifically regarding breast cancer. The WHO has advocated for the adoption of breast cancer knowledge and awareness as a viable medical strategy for the management of breast cancer. This approach is deemed essential and should be universally implemented, irrespective of financial constraints. In this regard, Izanloo et al. demonstrated that a significant majority of the participants, totaling over 84%, exhibited a lack of knowledge regarding breast cancer and screening tests among 14- to 84-year-old Iranian women. The primary factors cited by women as barriers to undergoing screening tests were the absence of discernible symptoms or issues and their perception of the test’s necessity. A significant difference was observed in the level of women’s knowledge of breast cancer and screening tests concerning factors such as employment status, education level, and family history of breast cancer. However, no significant difference was found in the level of knowledge among women based on their marital status or income level [ 52 ]. Moreover, a study conducted by Mehejabin et al. sought to examine the level of knowledge regarding various aspects of breast cancer among women in Bangladesh. The findings revealed that a majority of the participants, exceeding 50%, possessed a limited understanding of the risk factors associated with breast cancer, indicating a significant lack of knowledge [ 53 ].

Perceived treatment beliefs constituted an additional factor. Perceived belief in treatment can be influenced by various factors, including women’s spiritual and religious beliefs, familial history of breast cancer treatment, and prior experiences with breast cancer treatment [ 45 ]. Concerning this matter, individuals’ perceptions of their treatment beliefs have the potential to influence their engagement in protective behaviors. The findings of the study conducted by Mehejabin et al. indicate that a considerable proportion of women hold the belief that breast cancer can be detected at a young age. Furthermore, the participants held the belief that early diagnosis of the disease could lead to its potential cure [ 53 ]. The aforementioned findings align with the results obtained from a research study carried out at Dhaka Medical College Hospital in Bangladesh, wherein 51.43% of female participants indicated that early detection of breast cancer leads to a potential cure [ 54 ]. Suwankhong and Liamputtong have posited that religious belief significantly influences individuals’ decision-making processes concerning treatment options and risk factors associated with breast cancer [ 55 ]. Yew et al., found a significant difference in the perceptions of breast cancer risk with religious affiliations, specifically between the Muslim and Buddhist cohorts. The impact of Islam and Buddhism on individuals’ lifestyles and health-related behaviors has been significant. Muslim women exhibited a profound conviction in the authority of God (Allah), whereas Buddhist women commonly invoked their karma [ 56 ].

Perceived risk, identified as an additional extracted factor, holds a significant influence over breast cancer protective behavior [ 45 ]. Observing the challenges and distress experienced by our beloved individuals throughout breast cancer treatment amplifies both the perceived fear and the perceived risk of breast cancer [ 57 ]. The primary determinant of health behaviors for breast cancer prevention, diagnosis, and control is the perceived risk. Conversely, establishing concordance between the perceived risk and the objective risk of developing breast cancer results in a more accurate and actual perception of the risk. Consequently, it can serve as motivation for fostering suitable health behaviors [ 58 ]. Hajian et al. [ 59 ], examined the perceived risk of breast cancer among 800 Iranian women about the actual risk. The findings of the study revealed that both women with a low and high risk of breast cancer exhibited a significantly higher perceived risk of the disease compared to their actual risk. This finding suggests a significant inclination towards pessimism in the assessment of breast cancer risk, consistent with previous research conducted in this domain [ 60 , 61 ].

The results of this study also identified the perceived need for a health check as an additional extracted factor. One of the main obstacles to breast cancer screening among women is a diminished perception of the necessity for health screening. Women typically do not perceive the necessity of seeking medical attention unless they possess knowledge regarding the specific symptoms associated with a particular ailment [ 62 ]. Research findings indicate that women residing in developing nations often tend to decline the notion of early diagnosis and screening for breast cancer, primarily influenced by their cultural and personal beliefs. The aforementioned factor has a detrimental impact on the implementation of preventive measures aimed at mitigating the risk of breast cancer [ 63 ]. Therefore, variations in the perceived need for health screening can potentially impact individuals’ engagement in breast cancer protective behaviors. The scoping review study conducted by Omidi et al. examined the current status of breast cancer screening strategies and indicators among Iranian women. The study findings revealed that the prevalence rates of screening methods, including BSE, CBE, and mammography, among Iranian women were reported as 0-79.4%, 4.1–41.1%, and 1.3–45%, respectively [ 64 ].

Based on the HBM theory, Darvishpour et al. [ 25 ] posited that the decision of women to engage in breast cancer screening is influenced by factors such as self-efficacy and perceived benefits. Conversely, the presence of perceived barriers diminishes the likelihood of self-examination. According to Khazir et al., individuals who perceive fewer barriers are more likely to engage in breast cancer screening programs [ 65 ]. Abdel-Aziz et al. conducted a study utilizing EFA to examine the perceived barriers faced by women with breast cancer. Their findings indicate that personal fears, specifically fear of doctors/examiners, fear of screening results, and fear of the hospital environment, are the primary obstacles preventing women from utilizing free screening. These fears were identified as the main barriers based on their eigenvalue values, which exceeded 3.335, representing 30.4% of the barriers identified [ 66 ].

The final factor that was extracted pertains to the concept of perceived stigma. The symbolic significance of breasts for women stems from their association with childbirth, breastfeeding, childrearing, and sexual desires. Consequently, the symbolic significance associated with this phenomenon may impede women from accessing necessary healthcare services, interventions, or diagnostic procedures [ 67 ]. Furthermore, the absence of discussion regarding breast cancer and screening behaviors may be associated with societal stigmatization and cultural taboos surrounding the topic of breasts [ 68 ]. It is well known that stigma plays a significant role in the psychological distress that breast cancer-diagnosed women experience. The occurrence of rejection, blame, or devaluation is what defines the social phenomenon known as stigma. This arises from the personal experience, perception, or rational expectation of an unfavorable social evaluation directed toward an individual or a collective entity [ 69 ]. It was found that around 76.7% and 8.7% of breast cancer survivors reported moderate and high levels of stigma, respectively [ 70 ]. Based on prior research, it has been established that the perceived stigma among individuals diagnosed with breast cancer has significant adverse consequences for their overall well-being and health-related outcomes. These repercussions encompass various aspects such as sexual dysfunction, depressive symptoms, compromised sleep quality, reduced inclination to seek medical assistance, and diminished quality of life [ 71 ].

In terms of clinical application, the use of this scale is considered to save time and enable early detection of breast cancer during assessment. Utilizing this screeninig tool by health care providers improves a quick and comprehensive attitude toward breast cancer perception among women. The main advantage of BCPS is that it helps more subjective measurements compared to other scales in this area. In addition, its goal is to evaluate the relationship between breast cancer and diagnostic behaviors for breast cancer (such as maintaining healthy behaviors like diet, physical activity, mammography, breast self-examination, and clinical breast examination), knowledge about breast cancer, and family history of breast cancer [ 23 ].

Strength and limitation

The present study possesses several notable strengths. Firstly, it is the first study to assess BCPS among Iranian women. Secondly, the study adheres to the COSMIN checklist, ensuring methodological rigor. Additionally, the study incorporates both the DP and FB methods for the translation process, effectively addressing the limitations associated with the FB method. Lastly, the study includes a comparative analysis of BCPS with other versions.

However, it is important to acknowledge the limitations of the current study. These limitations include the lack of criterion validity calculations due to the lack of a gold standard, the lack of an assessment of cross-cultural validity and the possibility of bias from participants’ tendency to give socially desirable answers when using self-reported measures. As we conducted this study in Tabriz-Iran, should be cautious about the generalizability of findings. In conclusion, it is recommended that future research endeavors employ a larger sample sizeand assess the measurment properties in diverse contexts.

The obtained findings suggest that the Iranian version of the BCPS demonstrates satisfactory measurment properties for assessing the perception of breast cancer among Iranian women. Furthermore, it exhibits favorable responsiveness to clinical variations. The assessment of women’s perceptions of breast cancer is imperative for the advancement of preventive behaviors against this disease. The present scale can be employed for the assessment of the association between breast cancer and behaviors related to breast cancer diagnosis, including breast self-examination, clinical breast examination, mammography, and the adoption of healthy behaviors such as diet and exercise. Finally, it can be utilized to investigate the correlation between breast cancer knowledge and family history.

Data availability

The datasets generated and/or analyzed during the current study are not publicly available due to the limitations of ethical approval involving the patient data and anonymity, but are available from the corresponding author upon reasonable requests.

Abbreviations

Breast Cancer Perception Scale

Consensus-Based Standards for the Selection of Health Status Measurement Instruments

Breast Cancer

Global Cancer Incidence Mortality and Prevalence

World health organization

Eastern Mediterranean Region

Age-standardized rate

Hormone replacement therapy

Visual Analouge Scale

Health Belief Model

Breast self-examination

Clinical Breast Examination

Health related-patient reported outcomes

Dual pannel

Forward-backward

Exploratory factor analysis

Confirmatory factor analysis

Intra-class Correlation Coefficient

standard deviation

Content validity index

Content validity ratio

Standard error of measurment

Smallest detectable change

Minimal important change

Degree of freedom

Average variance extracted

Kaiser-Meyer-Olkin

Root mean squared error of approximation

Comparative fit index

Tucker–Lewis index

Ferlay J, Soerjomataram I, Ervik M, Dikshit S, Eser C, Mathers M, et al. V1. 0, cancer incidence and mortality worldwide: IARC CancerBase No. 11. 2013. Lyon, France: International Agency for Research on Cancer; 2012.

Google Scholar  

Fitzmaurice C, Dicker D, Pain A, Hamavid H, Moradi-Lakeh M, MacIntyre MF, et al. The global burden of Cancer 2013. JAMA Oncol. 2015;1(4):505–27.

Article   PubMed   Google Scholar  

Parkin DM, Läärä E, Muir CS. Estimates of the worldwide frequency of sixteen major cancers in 1980. Int J Cancer. 1988;41(2):184–97.

Article   CAS   PubMed   Google Scholar  

Sung H, Ferlay J, Siegel RL, Global Cancer S. 2020: GLOBOCAN Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries. CA Cancer J Clin. 2021;71(3):209 – 49.

Sedeta ET, Jobre B, Avezbakiyev B. Breast cancer: global patterns of incidence, mortality, and trends. J Clin Oncol. 2023;41(16).

Soerjomataram I, Bray F. Planning for tomorrow: global cancer incidence and the role of prevention 2020–2070. Nat Rev Clin Oncol. 2021;18(10):663–72.

Siegel RL, Giaquinto AN. Cancer statistics, 2024. CA Cancer J Clin. 2024;74(1):12–49.

Hortobagyi GN, de la Garza Salazar J, Pritchard K, Amadori D, Haidinger R, Hudis CA, et al. The global breast cancer burden: variations in epidemiology and survival. Clin Breast Cancer. 2005;6(5):391–401.

Sharif Nia H, Behmanesh F, Kwok C, Firouzbakht M, Ebadi A, Nikpour M. Breast cancer screening beliefs questionnaire: psychometric properties of the Persian version. BMC Womens Health. 2020;20(1):184.

Article   PubMed   PubMed Central   Google Scholar  

Akbari M, Abachizadeh K, Tabatabaei S, Ghanbari Motlagh A, Majd Jabari Z, Khayamzadeh M. Cancer in Iran. Qom: Darolfekr; 2008.

Razi S, Salehiniya H, Dizaji M. Epidemiology of prevalent cancer among Iranian women and its incidence trends from 2003–2009 in Iran. AMUJ. 2015;18(2):17–24.

Kazeminia M, Salari N, Hosseinian-Far A, Akbari H, Bazrafshan M-R, Mohammadi M. The prevalence of breast cancer in Iranian women: a systematic review and meta-analysis. Indian J Gynecol Oncol. 2022;20(1):14.

Article   Google Scholar  

Giaquinto AN, Sung H, Miller KD, Kramer JL, Newman LA, Minihan A, et al. Breast cancer statistics, 2022. CA Cancer J Clin. 2022;72(6):524–41.

Shamshirian A, Heydari K. Breast cancer risk factors in Iran: a systematic review & meta-analysis. Horm Mol Biol Clin Investig. 2020;41(4).

Weinstein ND. What does it mean to understand a risk? Evaluating risk comprehension. J Natl Cancer Inst Monogr. 1999(25):15–20.

Osei E, Osei Afriyie S, Oppong S, Ampofo E, Amu H. Perceived breast Cancer risk among female undergraduate students in Ghana: a cross-sectional study. J Oncol. 2021;2021:8811353.

Glanz K, Rimer BK, Viswanath K. Health behavior and health education: theory, research, and practice. Wiley; 2008.

Rakhshkhorshid M, Navaee M. The Association of Health Literacy with breast Cancer knowledge, perception and screening behavior. Eur J Breast Health. 2018;14(3):144–7.

PubMed   PubMed Central   Google Scholar  

Fletcher SW, Elmore JG. Clinical practice. Mammographic screening for breast cancer. N Engl J Med. 2003;348(17):1672–80.

Vetter L, Keller M, Bruckner T, Golatta M, Eismann S, Evers C, et al. Adherence to the breast cancer surveillance program for women at risk for familial breast and ovarian cancer versus overscreening: a monocenter study in Germany. Breast Cancer Res Treat. 2016;156(2):289–99.

Brinton LA, Brown SL, Colton T, Burich MC, Lubin J. Characteristics of a population of women with breast implants compared with women seeking other types of plastic surgery. Plast Reconstr Surg. 2000;105(3):919–27. discussion 28 – 9.

Gail MH, Brinton LA, Byar DP, Corle DK, Green SB, Schairer C, et al. Projecting individualized probabilities of developing breast cancer for white females who are being examined annually. J Natl Cancer Inst. 1989;81(24):1879–86.

Taylan S, Özkan İ. Breast Cancer Perception Scale: Psychometric Development Study. Eur J Breast Health. 2021;17(2):95–102.

Becker MH. The health belief model and personal health behavior. Health Educ Monogr. 1974;2:324–473.

Darvishpour A, Vajari SM, Noroozi S. Can Health Belief Model predict breast Cancer screening behaviors? Open Access Maced J Med Sci. 2018;6(5):949–53.

Fouladi N, Pourfarzi F, Mazaheri E, Asl HA, Rezaie M, Amani F, et al. Beliefs and behaviors of breast cancer screening in women referring to health care centers in northwest Iran according to the champion health belief model scale. Asian Pac J Cancer Prev. 2013;14(11):6857–62.

MASOUDI YL, Dashtbozorgi B, Gheibizadeh M, SAKI MA, Moradi M. Applying the health belief model in predicting breast cancer screening behavior of women. Jundishapur J Chronic Dis Care. 2015;4(4):e30234.

Mokkink LB, Terwee CB, Patrick DL, Alonso J, Stratford PW, Knol DL, et al. The COSMIN checklist for assessing the methodological quality of studies on measurement properties of health status measurement instruments: an international Delphi study. Qual Life Res. 2010;19(4):539–49.

Mokkink LB, Terwee CB, Patrick DL, Alonso J, Stratford PW, Knol DL, et al. COSMIN checklist manual. Amsterdam: University Medical Center; 2012.

World Health Organization. Process of translation and adaptation of instruments. 2010.

Kalfoss M. Translation and adaption of questionnaires: a nursing challenge. SAGE Open Nurs. 2019;5:2377960818816810.

Comrey A, Lee H. A first course in factor analysis: psychology press. Taylor and Francis Group New York, NY; 2013.

Nunnally JC, Bernstein IH. Psychometric theory. 3rd ed. New York: McGrawHill; 1994.

Terwee CB, Bot SD, de Boer MR, van der Windt DA, Knol DL, Dekker J, et al. Quality criteria were proposed for measurement properties of health status questionnaires. J Clin Epidemiol. 2007;60(1):34–42.

Seyf AA. Measurement, test and educational evaluation. 7th ed. Tehran: Douran; 2016.

Yang Z, Sun Y, Wang H, Zhang C, Wang A. A scale for measuring home-based cardiac rehabilitation exercise adherence: a development and validation study. BMC Nurs. 2023;22(1):259.

Lawshe CH. A quantitative approach to content validity. Pers Psychol. 1975;28(4):563–75.

Waltz CF, Bausell BR. Nursing research: design statistics and computer analysis. Davis Fa; 1981.

Polit DF, Beck CT, Owen SV. Is the CVI an acceptable indicator of content validity? Appraisal and recommendations. Res Nurs Health. 2007;30(4):459–67.

Harerimana A, Mtshali NG. Using exploratory and Confirmatory Factor Analysis to understand the role of technology in nursing education. Nurse Educ Today. 2020;92:104490.

Schreiber J, Nora A, Stage F, Barlow L, King J. Confirmatory factor analyses and structural equations modeling: an introduction and review. J Educ Res. 2006;99(6).

Gulledge CM, Smith DG, Ziedas A, Muh SJ, Moutzouros V, Makhni EC. Floor and Ceiling effects, Time to Completion, and question burden of PROMIS CAT domains among shoulder and knee patients undergoing nonoperative and operative treatment. JB JS Open Access. 2019;4(4).

Norman GR, Sloan JA, Wyrwich KW. Interpretation of changes in health-related quality of life: the remarkable universality of half a standard deviation. Med Care. 2003;41(5):582–92.

George TO, Allo TA, Amoo EO, Olonade O. Knowledge and attitudes about breast Cancer among women: a Wake-Up call in Nigeria. Open Access Maced J Med Sci. 2019;7(10):1700–5.

Kardan-Souraki M, Moosazadeh M, Khani S, Hamzehgardeshi Z. Factors related to breast Cancer screening in women in the Northern Part of Iran: a cross-sectional study. Open Access Maced J Med Sci. 2019;7(4):637–42.

Sohbet R, Karasu F. Investigation of the knowledge, behavior and applications of their women towards breast cancer. Gumushane Univ J Health Sci. 2017;6(4):113–21.

Aguirre-Camacho A, Hidalgo B, González-Cuevas G. Fear of breast cancer among young Spanish women: factor structure and psychometric properties of the Champion breast cancer fear scale. PLoS ONE. 2021;16(4):e0249562.

Article   CAS   PubMed   PubMed Central   Google Scholar  

Berry TR, Curtin KD, Courneya KS, McGannon KR, Norris CM, Rodgers WM, et al. Heart disease and breast cancer perceptions: ethnic differences and relationship to attentional bias. Health Psychol Open. 2016;3(2):2055102916657673.

Vraga EK, Stefanidis A, Lamprianidis G, Croitoru A. Cancer and Social Media: a comparison of traffic about breast Cancer, prostate Cancer, and other Reproductive cancers on Twitter and Instagram. J Health Commun. 2018;23(2):181–9.

Torre LA, Siegel RL, Ward EM, Jemal A. Global Cancer Incidence and Mortality Rates and Trends–An Update. Cancer Epidemiol Biomarkers Prev. 2016;25(1):16–27.

Pohl R, Pohl RF. Cognitive illusions: a handbook on fallacies and biases in thinking. judgement and memory: Psychology; 2004.

Izanloo A, Ghaffarzadehgan K, Khoshroo F, Erfani Haghiri M, Izanloo S, Samiee M, et al. Knowledge and attitude of women regarding breast cancer screening tests in Eastern Iran. Ecancermedicalscience. 2018;12:806.

Mehejabin F, Rahman MS. Knowledge and perception of breast cancer among women of reproductive age in Chattogram, Bangladesh: a cross-sectional survey. Health Sci Rep. 2022;5(5):e840.

Chowdhury S, Sultana S. Awareness on breast cancer among the women of reproductive age. JFRH. 2011;5(4).

Suwankhong D, Liamputtong P. Experiences of living with HIV: social stigma and discrimination among HIV-positive mothers in Southern Thailand. Malays J Med Health Sci. 2021;17:36–42.

Yew WC, Awang AH, Selvadurai S, Mohd Noor M, Chang PK. A Comparative Study of Islam and Buddhism: a Multicultural Society Perspective. Religions. 2021;12(12):1098.

Hashemi-Ghasemabadi M, Taleghani F, Kohan S, Yousefy A. Living under a cloud of threat: the experience of Iranian female caregivers with a first-degree relative with breast cancer. Psychooncology. 2017;26(5):625–31.

Katapodi MC, Dodd MJ, Facione NC, Humphreys JC, Lee KA. Why some women have an optimistic or a pessimistic bias about their breast cancer risk: experiences, heuristics, and knowledge of risk factors. Cancer Nurs. 2010;33(1):64–73.

Hajian-Tilaki K, Nikpour M. Accuracy of self-perceived risk perception of breast cancer development in Iranian women. BMC Womens Health. 2021;21(1):93.

Aduayi VA, Onayade AA, Aduayi OS, Odu OO. Relating self-perceived with objective risk for breast cancer among women in rural and urban communities in southwestern Nigeria. Int J Med Sci Public Health. 2015;4(10):1.

Ceber E, Soyer MT, Ciceklioglu M, Cimat S. Breast cancer risk assessment and risk perception on nurses and midwives in Bornova Health District in Turkey. Cancer Nurs. 2006;29(3):244–9.

Kwok C, Endrawes G, Lee CF. Cultural beliefs and attitudes about breast Cancer and Screening practices among Arabic women in Australia. Cancer Nurs. 2016;39(5):367–74.

Donnelly TT, Khater AH, Al-Bader SB, Al Kuwari MG, Al-Meer N, Malik M, et al. Arab women’s breast cancer screening practices: a literature review. Asian Pac J Cancer Prev. 2013;14(8):4519–28.

Omidi Z, Koosha M, Nazeri N, Khosravi N, Zolfaghari S, Haghighat S. Status of breast cancer screening strategies and indicators in Iran: a scoping review. J Res Med Sci. 2022;27:21.

Khazir Z, Morrowati Sharifabad MA, Vaezi AA, Enjezab B, Fallahzadeh H, Yari F. Predictors of mammography based on Health Belief Model in Khorramabad women. J Educ Health Promot. 2019;8:180.

Abdel-Aziz SB, Amin TT, Al-Gadeeb MB, Alhassar AI, Al-Ramadan A, Al-Helal M, et al. Perceived barriers to breast cancer screening among Saudi women at primary care setting. J Prev Med Hyg. 2018;59(1):E20–9.

CAS   PubMed   PubMed Central   Google Scholar  

Banning M, Hafeez H. A two-center study of muslim women’s views of breast cancer and breast health practices in Pakistan and the UK. J Cancer Educ. 2010;25(3):349–53.

Sheppard VB, Christopher J, Nwabukwu I. Breaking the silence barrier: opportunities to address breast cancer in African-born women. J Natl Med Assoc. 2010;102(6):461–8.

PubMed   Google Scholar  

Lebel S, Devins GM. Stigma in cancer patients whose behavior may have contributed to their disease. Future Oncol. 2008;4(5):717–33.

Jin R, Xie T, Zhang L, Gong N, Zhang J. Stigma and its influencing factors among breast cancer survivors in China: a cross-sectional study. Eur J Oncol Nurs. 2021;52:101972.

Wang Q-X, Bai Y, Lu G-F, Zhang C-Y. Perceived health-related stigma among patients with breast cancer. Chin Nurs Res. 2017;4(4):158–61.

Download references

Acknowledgements

We should thank the Vice-chancellor for Research of Tabriz University of Medical Sciences for their financial support and the invaluable participation of women would be appreciated.

This Study is funded by Tabriz University of Medical Sciences (grant number: 69930). The funding source had no role in the design and conduct of the study, and decision to this manuscript writing and submission.

Author information

Authors and affiliations.

Students Research Committee, Midwifery Department, Faculty of Nursing and Midwifery, Tabriz University of Medical sciences, Tabriz, Iran

Sepideh Mashayekh-Amiri

Cabrini Research, Cabrini Health, Melbourne, VIC, 3144, Australia

Mohammad Asghari Jafarabadi

School of Public Health and Preventative Medicine, Faculty of Medicine, Nursing and Health Sciences, Monash University, VIC 3800, Melbourne, Australia

Road Traffic Injury Research Center, Tabriz University of Medical Sciences, Tabriz, Iran

Department of Community Health Nursing, Nursing and Midwifery Faculty, Tabriz University of Medical Sciences, Tabriz, Iran

Mina Hosseinzadeh

Department of Community Health Nursing, Faculty of Nursing and Midwifery, Tabriz University of Medical sciences, Tabriz, Iran

Elham seyed Kanani

Social Determinants of Health Research Center, Department of Midwifery, Faculty of Nursing and Midwifery, Tabriz University of Medical Sciences, Tabriz, Iran

Mojgan Mirghafourvand

Medical Philosophy and History Reseach Center, Tabriz University of Medical Sciences, Tabriz, Iran

You can also search for this author in PubMed   Google Scholar

Contributions

MM, SMA, MH contributed to the design of the study. SMA, ESK, MM has written the first draft of this article and MAJ Analyzed and data. All authors have critically read the text and contributed with inputs and revisions, and all authors read and approved the final manuscript.

Corresponding author

Correspondence to Mojgan Mirghafourvand .

Ethics declarations

Ethics approval and consent to participate.

The current study was approved by the Ethics Committee of Tabriz University of Medical Sciences [ref: IR.TBZMED.REC.1401.390]. Written Informed consent to participate in the study was obtained from all the participants before enrolment. Permission to use the BCPS was obtained by sending an email from the original designer of the instrument. All methods were carried out in accordance with relevant guidelines and regulations.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note.

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ . The Creative Commons Public Domain Dedication waiver ( http://creativecommons.org/publicdomain/zero/1.0/ ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article.

Mashayekh-Amiri, S., Jafarabadi, M.A., Hosseinzadeh, M. et al. Measurement properties of the Iranian version of the breast cancer perception scale (BCPS) according to the COSMIN checklist. BMC Cancer 24 , 743 (2024). https://doi.org/10.1186/s12885-024-12493-2

Download citation

Received : 16 July 2023

Accepted : 10 June 2024

Published : 18 June 2024

DOI : https://doi.org/10.1186/s12885-024-12493-2

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

  • Breast cancer
  • Psychometric
  • Patient-reported outcome (PRO)
  • COSMIN checklist
  • Iranian validation

ISSN: 1471-2407

validity reliability and generalizability in qualitative research pdf

VIDEO

  1. Validity and Reliability in Research

  2. Validity and Reliability in Mixed-Methods Research

  3. Validity and Reliability in Research

  4. Objectivity, Validity and Reliability in Qualitative Research

  5. Validity and Reliability in Quantitative Research (A LECTURE IN URDU/اُردو میں لیکچر)

  6. Reliability, Validity, & Generalizability (7:03)

COMMENTS

  1. (PDF) Validity, reliability, and generalizability in qualitative research

    Same for quantitati ve research, quality for qu alitative research can be assessed in terms of validity, reliability, and g eneralizability. Validity. Validity in qualitative research means ...

  2. Validity, reliability, and generalizability in qualitative research

    Validity, reliability, and generalizability in qualitative research. In general practice, qualitative research contributes as significantly as quantitative research, in particular regarding psycho-social aspects of patient-care, health services provision, policy setting, and health administrations. In contrast to quantitative research ...

  3. (PDF) Validity and Reliability in Qualitative Research

    Validity and reliability or trustworthiness are fundamental issues in scientific research whether qualitative, quantitative, or mixed research. It is a necessity for researchers to describe which ...

  4. (PDF) Validity, Reliability, Generalizability

    1. Learning Outcome. The present module is designed to give the students a thorough under standing of the. key research concepts of validity, reliability and generalizability. By the end of the ...

  5. Validity, reliability, and generalizability in qualitative research

    Fundamental concepts of validity, reliability, and generalizability as applicable to qualitative research are then addressed with an update on the current views and controversies. Keywords: Controversies; generalizability; primary care research; qualitative research; reliability; validity. In general practice, qualitative research contributes ...

  6. PDF Issues of validity and reliability in qualitative research

    the validity and reliability of quantitative research cannot be applied to qualitative research, there are ongoing debates about whether terms such as validity, reliability and generalisability are appropriate to evalu-ate qualitative research.2-4 In the broadest context these terms are applicable, with validity referring to the integ-

  7. Validity, reliability, and generalizability in qualitative research

    Validity, reliability, and generalizability in qualitative research. This article illustrates with five published studies how qualitative research can impact and reshape the discipline of primary care, spiraling out from clinic-based health screening to community-based disease monitoring, evaluation of out-of-hours triage services to provincial ...

  8. Series: Practical guidance to qualitative research. Part 4

    ity, generalizability, reliability, and objectivity, are not suitable to judge the quality of qualitative research. Qualitative researchers speak of trustworthiness, which simply poses the question 'Can the findings to be trusted?' [4]. Several definitions and criteria of trust-worthiness exist (see Box 1)[2], but the best-known

  9. PDF Quality in Qualitative Studies: The Case of Validity, Reliability and

    validity, reliability and generalizability in qualitative research, this paper provides a review of relevant literature of recent years. To check on the extent of their reporting in research papers, a total of 15 accounting research recently published in top-ranked accounting journals have been selected.

  10. Issues of validity and reliability in qualitative research

    Although the tests and measures used to establish the validity and reliability of quantitative research cannot be applied to qualitative research, there are ongoing debates about whether terms such as validity, reliability and generalisability are appropriate to evaluate qualitative research.2-4 In the broadest context these terms are applicable, with validity referring to the integrity and ...

  11. Contextualizing reliability and validity in qualitative research

    Rather than prescribing what reliability and/or validity should look like, researchers should attend to the overall trustworthiness of qualitative research by more directly addressing issues associated with reliability and/or validity, as aligned with larger issues of ontological, epistemological, and paradigmatic affiliation.

  12. Generalizability in Qualitative Research: A Tale of Two Traditions

    Generalizability in qualitative research has been a controversial topic given that interpretivist scholars have resisted the dominant role and mandate of the positivist tradition within social sciences. Aiming to find universal laws, the positivist paradigm has made generalizability a crucial criterion for evaluating the rigor of quantitative ...

  13. (PDF) Quality in qualitative research: Through the lens of validity

    Fundamental concepts of validity, reliability, and generalizability as applicable to qualitative research are then addressed with an update on the current views and controversies. View full-text ...

  14. Validity, reliability, and generalizability in qualitative research

    Fundamental concepts of validity, reliability, and generalizability as applicable to qualitative research are then addressed with an update on the current views and controversies. Free full text J Family Med Prim Care . 2015 Jul-Sep; 4(3): 324-327.

  15. [PDF] Understanding Reliability and Validity in Qualitative Research

    The use of reliability and validity are common in quantitative research and now it is reconsidered in the qualitative research paradigm. Since reliability and validity are rooted in positivist perspective then they should be redefined for their use in a naturalistic approach. Like reliability and validity as used in quantitative research are providing springboard to examine what these two ...

  16. Promoting Rigorous Research: Generalizability and Qualitative Research

    First, we describe types of generalizability, the use of trustworthiness criteria, and strategies for maximizing generalizability within and across studies, then we discuss how the research approaches of grounded theory, autoethnography, content analysis, and metasynthesis can yield greater generalizability of findings.

  17. PDF Understanding and Using Trustworthiness in Qualitative Research

    oriented qualitative research, it is also productive to seek understanding from others' systematic qualitative inquiry. It is with such intentions that an analogy to both external validity and generalizability in quantitative research might be productive. As with quantitative research, qualitative inquiry seeks to expand

  18. PDF Validity, reliability, and generalizability in qualitative research

    quality for qualitative research can be assessed in terms of validity, reliability, and generalizability. Validity Validity in qualitative research means "appropriateness" of the tools, processes, and data. Whether the research question is valid for the desired outcome, the choice of methodology is appropriate

  19. LWW

    LWW

  20. Understanding Reliability and Validity in Qualitative Research

    To widen the spectrum of conceptualization of reliability and revealing the congruence of. reliability and validity in qualitative research, Lincoln and Guba (1985) states that: "Since there. can ...

  21. Validity, Reliability, and Generalizability in Qualitative

    Validity, Reliability, And Generalizability in Qualitative (3) - Free download as PDF File (.pdf), Text File (.txt) or read online for free. Metodologia Cualitiativa

  22. PDF Assessment, Evaluation and Research Relationships and Definitions in

    funding. Reliability, validity, trustworthiness. Role of the researcher Knowledgeable, accurate, ethical Active, including making recommendations "Not to say what is to be done but rather to say what has been done" (Campbell, 1991). Quantitative studies: theoretically non-existent; instrument and design replicable. Qualitative

  23. (PDF) Quality in Qualitative Studies: The Case of Validity, Reliability

    validity, reliabilit y and generalizability in qualitative research, this paper provides a review o f. relevant literature of recent years. To check on the extent of their reporting in research ...

  24. Measurement properties of the Iranian version of the breast cancer

    The researchers conducted an assessment of face validity using both qualitative and quantitative methods. To conduct a qualitative assessment of face validity, a sample of 10 women from health centers in Tabriz City was selected using a convenience sampling method. This sample then examined the initial questionnaire.

  25. (PDF) Generalizability in Qualitative Research: A Tale of Two Traditions

    'reliability', 'validity' and 'rigor', could be still used in interpretivist qualitative research and just redefined in terms of meaning (Guba & Lincoln, 1981; Morse et al., 2002). For ...