2011年10月27日 星期四

內部一致性 20120131更新

Internal consistency reveals the correlation or consistency of the items in the same domain in a questionnaire. It is often revealed by Cronbach's α.

中文解釋:
內部一致性是指問卷中同一面向的題目之間的相關性或一致性,通常以Cronbach's α表示。

判斷標準:Cronbach's α 介於0.7-0.9為內部一致性良好,α>0.95可能表示有些題目概念重複,可考慮刪除。 

臨床意義:

內部一致性越高,表示問卷各題所測量的概念之相關性越高。

限制:
Cronbach's α的數值易受問卷的題數影響。題數越多,Cronbach's α值越大。

研究設計:
邀請問卷之適用對象填寫問卷,再依據填寫結果計算問卷每個面向的Cronbach's α。

參考資料:

Cronbach, Lee. (1951). Coefficient alpha and the internal structure of tests. Psychometrika, 16, 297-334.

Rush, A. J., Trivedi, M. H., Ibrahim, H. M., Carmody, T. J., Arnow, B., Klein, D. N., . . . Keller, M. B. (2003). The 16-Item Quick Inventory of Depressive Symptomatology (QIDS), clinician rating (QIDS-C), and self-report (QIDS-SR): a psychometric evaluation in patients with chronic major depression. Biological Psychiatry, 54, 573-583.

Terwee, C. B., Bot, S. D. M., de Boer, M. R., van der Windt, Dawm, Knol, D. L., Dekker, J., . . . de Vet, H. C. W. (2007). Quality criteria were proposed for measurement properties of health status questionnaires. Journal of Clinical Epidemiology, 60, 34-42.

2011年10月26日 星期三

外在反應性

External responsiveness reflects the extent to which changes in a measure over a specified time frame relate to corresponding changes in a reference measure of health status.

Reference:
Husted JA, Cook RJ, Farewell VT, Gladman DD. Methods for assessing responsiveness: a critical review and recommendations. J Clin Epidemiol 2000; 53: 459-468.

外在反應性:可反映出某一評估工具於一特定時限後所評得的改變,與某一評估健康狀況的參考評估工具所評得的改變之間關聯的程度。

研究設計: 追蹤研究/重複評估。如: 入,出院時某一評估工具與參考評估工具皆各評估一次。

標準:以與參考評估工具之間的相關係數來驗證,Pearson's r/Spearman's ρ:≧0.75 良好;0.40-0.74 中等;≦0.40 差。

Model Paper:
Hsueh I, Mao H, Huang H, Hsieh C. Clinical applications of balance measures in stroke patients. Formosan Journal of Medicine 2001;5:261-268.

臨床意義:臨床上,病患在日常生活功能上的進步,常被臨床人員視為一個重要的改變。所以,我們有時會以評估日常生活功能的工具為參考架構,來驗證一個新的評估工具是否有能力偵測到臨床上重要的改變,即驗證這個新的評估工具的外在反應性。

同時效度-renew 1101

 Concurrent validity is “studies when the measurement to be validated and is measured with the previously validated measure at relatively the same time (concurrently), to see how well both measures are correlated. "

Portney LG, Watkins MP. Foundations of clinical research: Applications to practice. Upper Saddle River: Pearson Prentice Hall; 2009.
Hobart, J. C., Lamping D. L., & Thompson, A. J. (1996). Evaluating neurological outcome measures: The bare essentials. J Neurol Neurosurg Psychiatry, 60, 127-130.

同時效度 (concurrent validity) 是指評估工具的評估結果與目前公認之黃金標準的評估工具,評估結果之關聯程度,以瞭解評估工具是否評估到與公認標準之評估工具相同之建構,可檢驗評估工具結果的正確性。

判斷標準: 可使用Pearson’s r 來檢驗兩評估工具之相關。r 0.6 為具有良好同時效度之基本標準。
Salter K, Jutai JW, Teasell R, Foley NC, Bitensky J, Bayley M. Issues for selection of outcome measures in stroke rehabilitation: ICF activity. Disabil Rehabil 2005; 27: 315-340.

研究設計: 個案同時接受欲驗證的評估工具及效標工具之評估
臨床意義: 臨床上,可瞭解並確認如新發展之日常生活量表(具簡短,快速及全面之特性)與目前大家所公認使用之日常生活量表比較,是否也能有效地評估到相同之建構 (e.g.,日常生活功能)。如確認可評量到相同之建構,也許可考量使用新量表,以提升評估之效率。

Test-retest reliability

Test-retest reliability is the stability of a measuring instrument over time. It is assessed by giving the measure to the same subjects on two different occasions and examining the correlation between the two scores.

Reference: Hobart, J. C., Lamping D. L., & Thompson, A. J. (1996). Evaluating neurological outcome measures: The bare essentials. J Neurol Neurosurg Psychiatry, 60, 127-130.

中文解釋:再測信度指評估工具在不同時間測得結果之穩定性。再測信度的驗證方法為在不同時間下重複使用同一評估工具於相同的個案,計算不同時間測得分數之關聯性。

判斷標準:使用intraclass correlation coefficient (ICC)估計
                    0.39 poor reliability;
                    0.40-0.59 moderate reliability;
                    0.60-0.79 good reliability;
                    0.80-1.00 excellent reliability

臨床意義:良好再測信度的評估工具可確定測得之改變主要來自受測者本身的改變,而非測量誤差。

Model paper: Chen, K. L., Hsieh, C. L., Sheu, C. F., Hu, F. C., & Tseng, M. H. (2009). Reliability and validity of a Chinese version of the Pediatric Evaluation of Disability Inventory in children with cerebral palsy. J Rehabil Med, 41(4), 273-278. doi: 10.2340/16501977-0319

研究設計:評估同一群受測者二次以上,驗證於不同時間點所測量結果之一致性。

再測信度和測量誤差之關聯:再測信度是驗證於多次測量後,測量結果之穩定性。穩定性越高,測量結果的變化越小,測量結果一致性越高,表示測量受到誤差的影響越小。

2011年10月20日 星期四

表面效度(face validity) 20120131更新

Face validity means the the targeted population of  the measurement determines the measurement  relevant to the wanted construct and the wording written clearly.


中文解釋:表面效度意為評估工具的施用對象認為評估工具的內容與所欲測量的主題相關,而且用字清楚易懂。


臨床意義:
1.使受測者認為自己確實是接受特定主題的評估工具的評估,以提升其受測的配合度,確保填答結果之可信度。

2.使受測者正確理解評估工具的說明,以提升評估結果之可信度。


研究設計
邀請問卷之適用對象進行訪談或問卷調查,以瞭解問卷之適用對象認為此問卷內容是否與所欲測量概念相關、用字是否明確易懂。研究者參考適用對象之訪談內容與問卷調查結果,修改、刪除、新增評估工具的內容後,再次邀請適用對象提供評估工具內容之修改意見。最後,以問卷調查適用對象認為評估工具的內容相關程度以及措辭的清楚易懂程度(可用二分法或李克氏量尺)。相關程度與清楚易懂程度越高,表示表面效度越佳。



參考資料:
Broder, H. L., McGrath, C., & Cisneros, G. J. (2007). Questionnaire development: Face validity and item impact testing of the Child Oral Health Impact Profile. Community Dentistry and Oral Epidemiology, 35, 8-19.

Mosier, C. I. (1947). A critical examination of the concepts of face validity. Educational and Psychological Measurement, 7, 191-205.

Stineman, M. G., Ross, R. N., Fiedler, R., Granger, C. V., & Maislin, G. (2003). Functional independence staging: Conceptual foundation, face validity, and empirical derivation. 
Archives of Physical Medicine and Rehabilitation, 84, 29-37.

2011年10月19日 星期三

Reliable change index modified for practice

The reliable change index modified for practice (RCIp): This method provides criteria for meaningful change based on the calculated measurement error for each score.

Reference:Raymond, P. D., Hinton-Bayre, A. D., Radel, M., Ray, M. J., & Marsh, N. A. (2006). Assessment of statistical change criteria used to define significant change in neuropsychological test performance following cardiac surgery. Eur J Cardiothorac Surg, 29, 82-88.

中文解釋:練習效應之可信改變指數,計算測量誤差,提供標準以了解受測者的變化是否具有意義。

判斷標準:計算RCIp之90%信賴區間(confidence interval, CI),超過此信賴區間即表示受測者分數在統計上有顯著的進步或退步。

臨床意義:在考量練習效應後,受測者的分數變化是否在統計上達到顯著的改變(進不或退步)。

Model paper: Koh, C.-L., Lu, W.-S., Chen, H.-C., Hsueh, I.-P., Hsieh, J.-J., & Hsieh, C.-L. (2011). Test-retest reliability and practice effect of the Oral-format Symbol Digit Modalities Test in patients with stroke. Arch Clin Neuropsychol, 26(4), 356-363.

研究設計:評估同一群受測者二次以上,使用公示計算90% CI RCIp值。公式如下

90% CI RCIp = mean practice effect ± 1.645 × SEdiff,                                [1]
SEdiff = √2(SEM)2,                                                                                         [2]
SEM = √1-rxx,                                                                                                 [3]

最小可偵測變化值

The minimal detectable change (MDC) “is considered the minimal amount of change that is not likely to be due to chance variation or random measurement error in measurement,” and can be calculated to varying degrees of confidence (usually 95%).

Haley SM, Fragala-Pinkham MA. Interpreting change scores of tests and measures used in physical therapy. Phys Ther. 2006; 86: 735-743.

中文解釋: 最小可偵測變化值是指個案評估數據之差異值(個案經過特定治療後,前後評量所得數據之差異),此差異值是個案真正的改變,而非測量誤差所導致的。MDC通常以 95% 之信心水準為判定標準。

判斷標準: 若評估工具的MDC相對於評估工具總分低於10%,可視為良好可接受的評估誤差。
Flansbjer UB, Holmback AAM, Downham D, Patten C, Lexell J. Reliability of gait performance tests in men and women with hemiparesis after stroke. J Rehabil Med 2005; 37; 75-82.

研究設計: 同一個案接受同一或不同施測者評估至少2次評估(例如:前測及後測)


臨床上,治療師可以依據2次評估數據之差異來判斷,個案分數改變的大小若 ≥ MDC值,治療師具備95%之信心水準判定個案產生真正的進步,而非測量誤差所導致