Scraping Cochrane Reviews, some trials


CAUTION

This is beta stuff! Use with care.


1 Load packages

library(tidyverse)  # data wrangling
library(rvest)  # web scraping
library(xml2)  # web scraping
library(stringr)  # string manipulation
library(printr)  # print dfs as tables

The Cochrane Library (ISSN 1465-1858) is a collection of databases that contain different types of high-quality, independent evidence to inform healthcare decision-making.

Source: Cochrane

For that reason the site holds a treasure of information for health scientists. Let’s try to use some automatic tools to retrieve that information, ie., let’s do some web scraping.

2 Parse one review

Let’s read one page of the Cochrane Library, a page on which a review is published. For no particular reason, just as a showcase, let’s pickthis review:

cochrane_url <- "https://www.cochranelibrary.com/cdsr/doi/10.1002/14651858.CD012791.pub2/full"

Let’s parse the page:

page_content <- read_html(cochrane_url)  

Let’s have a glimpse at the parsed page:

str(page_content)
#> List of 2
#>  $ node:<externalptr> 
#>  $ doc :<externalptr> 
#>  - attr(*, "class")= chr [1:2] "xml_document" "xml_node"

3 Parse the title

title_publication <- page_content %>% 
  html_nodes(".publication-title") %>% 
  html_text()

Mindfulness‐based stress reduction for family carers of people with dementia

4 Parse the abstract

abstract <- page_content %>%
  html_node("body") %>% 
  xml_find_all("//section[contains(@class, 'abstract')]") %>% 
  rvest::html_text()

Abstract available in English Español Français 简体中文 Background Caring for people with dementia is highly challenging, and family carers are recognised as being at increased risk of physical and mental ill‐health. Most current interventions have limited success in reducing stress among carers of people with dementia. Mindfulness‐based stress reduction (MBSR) draws on a range of practices and may be a promising approach to helping carers of people with dementia. Objectives To assess the effectiveness of MBSR in reducing the stress of family carers of people with dementia. Search methods We searched ALOIS ‐ the Cochrane Dementia and Cognitive Improvement Group’s Specialized Register, the Cochrane Central Register of Controlled Trials (CENTRAL) (all years to Issue 9 of 12, 2017), MEDLINE (Ovid SP 1950 to September 2017), Embase (Ovid SP 1974 to Sepetmber 2017), Web of Science (ISI Web of Science 1945 to September 2017), PsycINFO (Ovid SP 1806 to September 2017), CINAHL (all dates to September 2017), LILACS (all dates to September 2017), World Health Organization (WHO) International Clinical Trials Registry Platform (ICTRP), ClinicalTrials.gov, and Dissertation Abstracts International (DAI) up to 6 September 2017, with no language restrictions. Selection criteria Randomised controlled trials (RCTs) of MBSR for family carers of people with dementia. Data collection and analysis Two review authors independently screened references for inclusion criteria, extracted data, assessed the risk of bias of trials with the Cochrane ‘Risk of bias’ tool, and evaluated the quality of the evidence using the GRADE instrument. We contacted study authors for additional information, then conducted meta‐analyses, or reported results narratively in the case of insufficient data. We used standard methodological procedures expected by Cochrane. Main results We included five RCTs involving 201 carers assessing the effectiveness of MBSR. Controls used in included studies varied in structure and content. Mindfulness‐based stress reduction programmes were compared with either active controls (those matched for time and attention with MBSR, i.e. education, social support, or progressive muscle relaxation), or inactive controls (those not matched for time and attention with MBSR, i.e. self help education or respite care). One trial used both active and inactive comparisons with MBSR. All studies were at high risk of bias in terms of blinding of outcome assessment. Most studies provided no information about selective reporting, incomplete outcome data, or allocation concealment. 1. Compared with active controls, MBSR may reduce depressive symptoms of carers at the end of the intervention (3 trials, 135 participants; standardised mean difference (SMD) ‐0.63, 95% confidence interval (CI) ‐0.98 to ‐0.28; P<0.001; low‐quality evidence). We could not be certain of any effect on clinically significant depressive symptoms (very low‐quality evidence). Mindfulness‐based stress reduction compared with active control may decrease carer anxiety at the end of the intervention (1 trial, 78 participants; mean difference (MD) ‐7.50, 95% CI ‐13.11 to ‐1.89; P<0.001; low‐quality evidence) and may slightly increase carer burden (3 trials, 135 participants; SMD 0.24, 95% CI ‐0.11 to 0.58; P=0.18; low‐quality evidence), although both results were imprecise, and we could not exclude little or no effect. Due to the very low quality of the evidence, we could not be sure of any effect on carers’ coping style, nor could we determine whether carers were more or less likely to drop out of treatment. 2. Compared with inactive controls, MBSR showed no clear evidence of any effect on depressive symptoms (2 trials, 50 participants; MD ‐1.97, 95% CI ‐6.89 to 2.95; P=0.43; low‐quality evidence). We could not be certain of any effect on clinically significant depressive symptoms (very low‐quality evidence). In this comparison, MBSR may also reduce carer anxiety at the end of the intervention (1 trial, 33 participants; MD ‐7.27, 95% CI ‐14.92 to 0.38; P=0.06; low‐quality evidence), although we were unable to exclude little or no effect. Due to the very low quality of the evidence, we could not be certain of any effects of MBSR on carer burden, the use of positive coping strategies, or dropout rates. We found no studies that looked at quality of life of carers or care‐recipients, or institutionalisation. Only one included study reported on adverse events, noting a single adverse event related to yoga practices at home Authors’ conclusions After accounting for non‐specific effects of the intervention (i.e. comparing it with an active control), low‐quality evidence suggests that MBSR may reduce carers’ depressive symptoms and anxiety, at least in the short term. There are significant limitations to the evidence base on MBSR in this population. Our GRADE assessment of the evidence was low to very low quality. We downgraded the quality of the evidence primarily because of high risk of detection or performance bias, and imprecision. In conclusion, MBSR has the potential to meet some important needs of the carer, but more high‐quality studies in this field are needed to confirm its efficacy.

5 Segment the abstract

5.1 Background

background_raw <- str_extract(abstract, "Background.+Objectives")

background <- 
  background_raw %>% str_remove_all("Background|Objectives")

Caring for people with dementia is highly challenging, and family carers are recognised as being at increased risk of physical and mental ill‐health. Most current interventions have limited success in reducing stress among carers of people with dementia. Mindfulness‐based stress reduction (MBSR) draws on a range of practices and may be a promising approach to helping carers of people with dementia.

5.2 Objectives

objectives_raw <- str_extract(abstract, "Objectives.+Search methods")

objectives <- 
  objectives_raw %>% str_remove_all("Objectives|Search methods")
objectives
#> [1] " To assess the effectiveness of MBSR in reducing the stress of family carers of people with dementia.  "

To assess the effectiveness of MBSR in reducing the stress of family carers of people with dementia.

5.3 And so forth

… for the other structured parts of the abstract.

6 Summary of Findings table

It should be noted that there are at times more than one table of findings. Let’s be content with the first one.

6.1 Parse node of class ‘summaryOfFindings’

By looking around in the source code of the web page, I found the class summaryOfFindings. Sounds promising.

summaryOfFindings <- 
  page_content %>% 
  html_node("body") %>% 
  xml_find_all("//section[contains(@class, 'summaryOfFindings')]") %>% 
  html_text()

summaryOfFindings
#> [1] "  Summary of findings      Open in table viewerSummary of findings for the main comparison. MBSR compared to active control for family carers of people with dementia  MBSR (mindfulness‐based stress reduction) compared to active control for family carers of people with dementia    Patient or population: family carers of people with dementiaSetting: communityIntervention: MBSRComparison: active control    Outcomes    Anticipated absolute effects* (95% CI)    Relative effect (95% CI)    № of participants (studies)    Quality of the evidence (GRADE)    Comments    Risk with active control    Risk with MBSR    Depressive symptoms assessed with: CESD, POMS‐depression subscale Scale from 0 to 60 follow‐up: range 7 weeks to 8 weeks    ‐   SMD 0.63 SD lower (0.98 lower to 0.28 lower)    ‐   135 (3 RCTs)    ⊕⊕⊝⊝ LOW a b   Moderate effect size;c lower score represents lower depressive symptoms    Anxiety assessed with: STAI‐state anxiety subscale Scale from 20 to 80 follow‐up: 8 weeks    The mean anxiety was 47.4 score.   MD 7.5 score lower (13.11 lower to 1.89 lower)    ‐   78 (1 RCT)    ⊕⊕⊝⊝ LOW a b   Moderate effect size;c lower score represents lower level of anxiety   Carer burden assessed with: MBCBS‐subjective stress burden, RMBPC‐reaction, ZBI Scale from 0 to 100 follow‐up: range 7 weeks to 8 weeks    ‐   SMD 0.24 SD higher (0.11 lower to 0.58 higher)    ‐   135 (3 RCTs)    ⊕⊕⊝⊝ LOW a b   Small effect size;c lower score represents lower level of carer burden   Dropout rates follow‐up: range 7 weeks to 8 weeks    Study population   RR 1.39 (0.32 to 5.99)    166 (4 RCTs)    ⊕⊝⊝⊝ VERY LOW a d   Important effect sizee   88 per 1000   122 per 1000 (28 to 524)    *The risk in the intervention group (and its 95% confidence interval) is based on the assumed risk in the comparison group and the relative effect of the intervention (and its 95% CI).CESD: Center for Epidemiological Studies Depression Scale; CI: confidence interval; MBCBS: Montgomery Borgatta Caregiver Burden Scale; MD: mean difference; POMS: Profile of Mood States: RCT: randomised controlled trial; RMBPC: Revised Memory and Behavior Problems Checklist; RR: risk ratio; SD: standard deviation; SMD: standardised mean difference; STAI: State‐Trait Anxiety Inventory; ZBI: Zarit Burden Interview    GRADE Working Group grades of evidenceHigh quality: We are very confident that the true effect lies close to that of the estimate of the effect.Moderate quality: We are moderately confident in the effect estimate: the true effect is likely to be close to the estimate of the effect, but there is a possibility that it is substantially different.Low quality: Our confidence in the effect estimate is limited: the true effect may be substantially different from the estimate of the effect.Very low quality: We have very little confidence in the effect estimate: the true effect is likely to be substantially different from the estimate of effect.     aWe downgraded the quality of evidence by one level due to serious concern about high risk of bias in blinding of participants and personnel.bAs suggested by Ryan 2016a, we downgraded the quality of evidence by one level due to serious concern about imprecision.cTo assess the magnitude of effect for continuous outcomes, we used the criteria suggested by Cohen 1988: 0.2 represents a small effect, 0.5 a moderate effect, and 0.8 a large effect.dAs suggested by Ryan 2016a, we downgraded the quality of evidence by two levels due to very serious concern about imprecision.eAs suggested by Ryan 2016b, we considered an RR < 0.75 or RR > 1.25 an important effect size for dichotomous outcomes.      Open in table viewerSummary of findings 2. MBSR compared to inactive control for family carers of people with dementia  MBSR (mindfulness‐based stress reduction) compared to inactive control for family carers of people with dementia    Patient or population: family carers of people with dementiaSetting: communityIntervention: MBSRComparison: inactive control    Outcomes    Anticipated absolute effects* (95% CI)    Relative effect (95% CI)    № of participants (studies)    Quality of the evidence (GRADE)    Comments    Risk with inactive control    Risk with MBSR    Depressive symptoms assessed with: CESD Scale from 0 to 60 follow‐up: range 7 weeks to 8 weeks    The mean depressive symptoms was 14.2 score.   MD 1.97 score lower (6.89 lower to 2.95 higher)    ‐   50 (2 RCTs)    ⊕⊕⊝⊝ LOW a b   Small effect size;c lower score represents lower depressive symptoms   Anxiety assessed with: STAI‐state anxiety Scale from 20 to 80 follow‐up: 8 weeks    The mean anxiety was 47.8 score.   MD 7.27 score lower (14.92 lower to 0.38 higher)    ‐   33 (1 RCT)    ⊕⊕⊝⊝ LOW a b   Moderate effect size;c lower score represents lower level of anxiety   Carer burden assessed with: RMBPC‐reaction Scale from 0 to 96 follow‐up: 7 weeks    The mean carer burden was 26.4 score.   MD 1.6 score lower (19.48 lower to 16.28 higher)    ‐   17 (1 RCT)    ⊕⊕⊝⊝ VERY LOW b d   Small effect size;c lower score represents lower level of carer burden   Dropout rates follow‐up: 7 weeks    Study population   RR 2.00 (0.21 to 18.69)    20 (1 RCT)    ⊕⊝⊝⊝ VERY LOW b d   Important effect sizee   100 per 1000   200 per 1000 (21 to 1000)    *The risk in the intervention group (and its 95% confidence interval) is based on the assumed risk in the comparison group and the relative effect of the intervention (and its 95% CI).CESD: Center for Epidemiological Studies Depression Scale; CI: confidence interval; MD: mean difference; RCT: randomised controlled trial; RMBPC: Revised Memory and Behavior Problems Checklist; RR: risk ratio; STAI: State‐Trait Anxiety Inventory    GRADE Working Group grades of evidenceHigh quality: We are very confident that the true effect lies close to that of the estimate of the effect.Moderate quality: We are moderately confident in the effect estimate: the true effect is likely to be close to the estimate of the effect, but there is a possibility that it is substantially different.Low quality: Our confidence in the effect estimate is limited: the true effect may be substantially different from the estimate of the effect.Very low quality: We have very little confidence in the effect estimate: the true effect is likely to be substantially different from the estimate of effect.     aAs suggested by Ryan 2016a, we downgraded the quality of evidence by one level due to serious concern about imprecision.bWe downgraded the quality of evidence by one level due to serious concern about high risk of bias in blinding of participants and personnel.cTo assess the magnitude of effect for continuous outcomes, we used the criteria suggested by Cohen 1988: 0.2 represents a small effect, 0.5 a moderate effect, and 0.8 a large effect.dAs suggested by Ryan 2016a, we downgraded the quality of evidence by two levels due to very serious concern about imprecision.eAs suggested by Ryan 2016b, we considered an RR < 0.75 or RR > 1.25 an important effect size for dichotomous outcomes.     "
#> [2] "  Open in table viewerSummary of findings for the main comparison. MBSR compared to active control for family carers of people with dementia  MBSR (mindfulness‐based stress reduction) compared to active control for family carers of people with dementia    Patient or population: family carers of people with dementiaSetting: communityIntervention: MBSRComparison: active control    Outcomes    Anticipated absolute effects* (95% CI)    Relative effect (95% CI)    № of participants (studies)    Quality of the evidence (GRADE)    Comments    Risk with active control    Risk with MBSR    Depressive symptoms assessed with: CESD, POMS‐depression subscale Scale from 0 to 60 follow‐up: range 7 weeks to 8 weeks    ‐   SMD 0.63 SD lower (0.98 lower to 0.28 lower)    ‐   135 (3 RCTs)    ⊕⊕⊝⊝ LOW a b   Moderate effect size;c lower score represents lower depressive symptoms    Anxiety assessed with: STAI‐state anxiety subscale Scale from 20 to 80 follow‐up: 8 weeks    The mean anxiety was 47.4 score.   MD 7.5 score lower (13.11 lower to 1.89 lower)    ‐   78 (1 RCT)    ⊕⊕⊝⊝ LOW a b   Moderate effect size;c lower score represents lower level of anxiety   Carer burden assessed with: MBCBS‐subjective stress burden, RMBPC‐reaction, ZBI Scale from 0 to 100 follow‐up: range 7 weeks to 8 weeks    ‐   SMD 0.24 SD higher (0.11 lower to 0.58 higher)    ‐   135 (3 RCTs)    ⊕⊕⊝⊝ LOW a b   Small effect size;c lower score represents lower level of carer burden   Dropout rates follow‐up: range 7 weeks to 8 weeks    Study population   RR 1.39 (0.32 to 5.99)    166 (4 RCTs)    ⊕⊝⊝⊝ VERY LOW a d   Important effect sizee   88 per 1000   122 per 1000 (28 to 524)    *The risk in the intervention group (and its 95% confidence interval) is based on the assumed risk in the comparison group and the relative effect of the intervention (and its 95% CI).CESD: Center for Epidemiological Studies Depression Scale; CI: confidence interval; MBCBS: Montgomery Borgatta Caregiver Burden Scale; MD: mean difference; POMS: Profile of Mood States: RCT: randomised controlled trial; RMBPC: Revised Memory and Behavior Problems Checklist; RR: risk ratio; SD: standard deviation; SMD: standardised mean difference; STAI: State‐Trait Anxiety Inventory; ZBI: Zarit Burden Interview    GRADE Working Group grades of evidenceHigh quality: We are very confident that the true effect lies close to that of the estimate of the effect.Moderate quality: We are moderately confident in the effect estimate: the true effect is likely to be close to the estimate of the effect, but there is a possibility that it is substantially different.Low quality: Our confidence in the effect estimate is limited: the true effect may be substantially different from the estimate of the effect.Very low quality: We have very little confidence in the effect estimate: the true effect is likely to be substantially different from the estimate of effect.     aWe downgraded the quality of evidence by one level due to serious concern about high risk of bias in blinding of participants and personnel.bAs suggested by Ryan 2016a, we downgraded the quality of evidence by one level due to serious concern about imprecision.cTo assess the magnitude of effect for continuous outcomes, we used the criteria suggested by Cohen 1988: 0.2 represents a small effect, 0.5 a moderate effect, and 0.8 a large effect.dAs suggested by Ryan 2016a, we downgraded the quality of evidence by two levels due to very serious concern about imprecision.eAs suggested by Ryan 2016b, we considered an RR < 0.75 or RR > 1.25 an important effect size for dichotomous outcomes.      Open in table viewerSummary of findings 2. MBSR compared to inactive control for family carers of people with dementia  MBSR (mindfulness‐based stress reduction) compared to inactive control for family carers of people with dementia    Patient or population: family carers of people with dementiaSetting: communityIntervention: MBSRComparison: inactive control    Outcomes    Anticipated absolute effects* (95% CI)    Relative effect (95% CI)    № of participants (studies)    Quality of the evidence (GRADE)    Comments    Risk with inactive control    Risk with MBSR    Depressive symptoms assessed with: CESD Scale from 0 to 60 follow‐up: range 7 weeks to 8 weeks    The mean depressive symptoms was 14.2 score.   MD 1.97 score lower (6.89 lower to 2.95 higher)    ‐   50 (2 RCTs)    ⊕⊕⊝⊝ LOW a b   Small effect size;c lower score represents lower depressive symptoms   Anxiety assessed with: STAI‐state anxiety Scale from 20 to 80 follow‐up: 8 weeks    The mean anxiety was 47.8 score.   MD 7.27 score lower (14.92 lower to 0.38 higher)    ‐   33 (1 RCT)    ⊕⊕⊝⊝ LOW a b   Moderate effect size;c lower score represents lower level of anxiety   Carer burden assessed with: RMBPC‐reaction Scale from 0 to 96 follow‐up: 7 weeks    The mean carer burden was 26.4 score.   MD 1.6 score lower (19.48 lower to 16.28 higher)    ‐   17 (1 RCT)    ⊕⊕⊝⊝ VERY LOW b d   Small effect size;c lower score represents lower level of carer burden   Dropout rates follow‐up: 7 weeks    Study population   RR 2.00 (0.21 to 18.69)    20 (1 RCT)    ⊕⊝⊝⊝ VERY LOW b d   Important effect sizee   100 per 1000   200 per 1000 (21 to 1000)    *The risk in the intervention group (and its 95% confidence interval) is based on the assumed risk in the comparison group and the relative effect of the intervention (and its 95% CI).CESD: Center for Epidemiological Studies Depression Scale; CI: confidence interval; MD: mean difference; RCT: randomised controlled trial; RMBPC: Revised Memory and Behavior Problems Checklist; RR: risk ratio; STAI: State‐Trait Anxiety Inventory    GRADE Working Group grades of evidenceHigh quality: We are very confident that the true effect lies close to that of the estimate of the effect.Moderate quality: We are moderately confident in the effect estimate: the true effect is likely to be close to the estimate of the effect, but there is a possibility that it is substantially different.Low quality: Our confidence in the effect estimate is limited: the true effect may be substantially different from the estimate of the effect.Very low quality: We have very little confidence in the effect estimate: the true effect is likely to be substantially different from the estimate of effect.     aAs suggested by Ryan 2016a, we downgraded the quality of evidence by one level due to serious concern about imprecision.bWe downgraded the quality of evidence by one level due to serious concern about high risk of bias in blinding of participants and personnel.cTo assess the magnitude of effect for continuous outcomes, we used the criteria suggested by Cohen 1988: 0.2 represents a small effect, 0.5 a moderate effect, and 0.8 a large effect.dAs suggested by Ryan 2016a, we downgraded the quality of evidence by two levels due to very serious concern about imprecision.eAs suggested by Ryan 2016b, we considered an RR < 0.75 or RR > 1.25 an important effect size for dichotomous outcomes.     "

So we have found the summary of findings, but it’s not a table, it’s rather a hot mess. Let’s try to make sense of it.

6.2 Table by ID

Looking further in the source code of the site, I realized that the tables have a unique ID, let’s address the table by its ID.

page_content %>% 
  html_nodes("#CD012791-tbl-0001") %>% 
  html_text()
#> [1] " Open in table viewerSummary of findings for the main comparison. MBSR compared to active control for family carers of people with dementia  MBSR (mindfulness‐based stress reduction) compared to active control for family carers of people with dementia    Patient or population: family carers of people with dementiaSetting: communityIntervention: MBSRComparison: active control    Outcomes    Anticipated absolute effects* (95% CI)    Relative effect (95% CI)    № of participants (studies)    Quality of the evidence (GRADE)    Comments    Risk with active control    Risk with MBSR    Depressive symptoms assessed with: CESD, POMS‐depression subscale Scale from 0 to 60 follow‐up: range 7 weeks to 8 weeks    ‐   SMD 0.63 SD lower (0.98 lower to 0.28 lower)    ‐   135 (3 RCTs)    ⊕⊕⊝⊝ LOW a b   Moderate effect size;c lower score represents lower depressive symptoms    Anxiety assessed with: STAI‐state anxiety subscale Scale from 20 to 80 follow‐up: 8 weeks    The mean anxiety was 47.4 score.   MD 7.5 score lower (13.11 lower to 1.89 lower)    ‐   78 (1 RCT)    ⊕⊕⊝⊝ LOW a b   Moderate effect size;c lower score represents lower level of anxiety   Carer burden assessed with: MBCBS‐subjective stress burden, RMBPC‐reaction, ZBI Scale from 0 to 100 follow‐up: range 7 weeks to 8 weeks    ‐   SMD 0.24 SD higher (0.11 lower to 0.58 higher)    ‐   135 (3 RCTs)    ⊕⊕⊝⊝ LOW a b   Small effect size;c lower score represents lower level of carer burden   Dropout rates follow‐up: range 7 weeks to 8 weeks    Study population   RR 1.39 (0.32 to 5.99)    166 (4 RCTs)    ⊕⊝⊝⊝ VERY LOW a d   Important effect sizee   88 per 1000   122 per 1000 (28 to 524)    *The risk in the intervention group (and its 95% confidence interval) is based on the assumed risk in the comparison group and the relative effect of the intervention (and its 95% CI).CESD: Center for Epidemiological Studies Depression Scale; CI: confidence interval; MBCBS: Montgomery Borgatta Caregiver Burden Scale; MD: mean difference; POMS: Profile of Mood States: RCT: randomised controlled trial; RMBPC: Revised Memory and Behavior Problems Checklist; RR: risk ratio; SD: standard deviation; SMD: standardised mean difference; STAI: State‐Trait Anxiety Inventory; ZBI: Zarit Burden Interview    GRADE Working Group grades of evidenceHigh quality: We are very confident that the true effect lies close to that of the estimate of the effect.Moderate quality: We are moderately confident in the effect estimate: the true effect is likely to be close to the estimate of the effect, but there is a possibility that it is substantially different.Low quality: Our confidence in the effect estimate is limited: the true effect may be substantially different from the estimate of the effect.Very low quality: We have very little confidence in the effect estimate: the true effect is likely to be substantially different from the estimate of effect.     aWe downgraded the quality of evidence by one level due to serious concern about high risk of bias in blinding of participants and personnel.bAs suggested by Ryan 2016a, we downgraded the quality of evidence by one level due to serious concern about imprecision.cTo assess the magnitude of effect for continuous outcomes, we used the criteria suggested by Cohen 1988: 0.2 represents a small effect, 0.5 a moderate effect, and 0.8 a large effect.dAs suggested by Ryan 2016a, we downgraded the quality of evidence by two levels due to very serious concern about imprecision.eAs suggested by Ryan 2016b, we considered an RR < 0.75 or RR > 1.25 an important effect size for dichotomous outcomes.    "

Not really better.

6.3 Looking for tables

Let’s search explicitly for tables.

page_content %>% 
  html_nodes("table")
#> {xml_nodeset (9)}
#> [1] <table class="summary-of-findings framed">\n<a href="#0" class="open-tabl ...
#> [2] <table class="summary-of-findings framed">\n<a href="#0" class="open-tabl ...
#> [3] <table data-id="CD012791-tbl-0001" class="summary-of-findings framed">\n< ...
#> [4] <table data-id="CD012791-tbl-0002" class="summary-of-findings framed">\n< ...
#> [5] <table data-id="CD012791-tbl-0003" class="comparison">\n<div class="table ...
#> [6] <table data-id="CD012791-tbl-0004" class="comparison">\n<div class="table ...
#> [7] <table data-id="CD012791-tbl-0005" class="comparison">\n<div class="table ...
#> [8] <table data-id="CD012791-tbl-0006" class="comparison">\n<div class="table ...
#> [9] <table data-id="CD012791-tbl-0007" class="comparison">\n<div class="table ...

Looks promising. Let’s have a look at the first table.

summaryOfFindingsTable <- 
page_content %>% 
  html_nodes("table") %>% 
  .[[1]] %>% 
  html_table(fill = TRUE)

summaryOfFindingsTable
X1 X2 X3 X4 X5 X6 X7
MBSR (mindfulness‐based stress reduction) compared to active control for family carers of people with dementia MBSR (mindfulness‐based stress reduction) compared to active control for family carers of people with dementia MBSR (mindfulness‐based stress reduction) compared to active control for family carers of people with dementia MBSR (mindfulness‐based stress reduction) compared to active control for family carers of people with dementia MBSR (mindfulness‐based stress reduction) compared to active control for family carers of people with dementia MBSR (mindfulness‐based stress reduction) compared to active control for family carers of people with dementia MBSR (mindfulness‐based stress reduction) compared to active control for family carers of people with dementia
Patient or population: family carers of people with dementiaSetting: communityIntervention: MBSRComparison: active control Patient or population: family carers of people with dementiaSetting: communityIntervention: MBSRComparison: active control Patient or population: family carers of people with dementiaSetting: communityIntervention: MBSRComparison: active control Patient or population: family carers of people with dementiaSetting: communityIntervention: MBSRComparison: active control Patient or population: family carers of people with dementiaSetting: communityIntervention: MBSRComparison: active control Patient or population: family carers of people with dementiaSetting: communityIntervention: MBSRComparison: active control Patient or population: family carers of people with dementiaSetting: communityIntervention: MBSRComparison: active control
Outcomes Anticipated absolute effects* (95% CI) Anticipated absolute effects* (95% CI) Relative effect (95% CI) № of participants (studies) Quality of the evidence (GRADE) Comments
Outcomes Risk with active control Anticipated absolute effects* (95% CI) Relative effect (95% CI) № of participants (studies) Quality of the evidence (GRADE) Risk with MBSR
Depressive symptoms assessed with: CESD, POMS‐depression subscale Scale from 0 to 60 follow‐up: range 7 weeks to 8 weeks SMD 0.63 SD lower (0.98 lower to 0.28 lower) 135 (3 RCTs) ⊕⊕⊝⊝ LOW a b Moderate effect size;c lower score represents lower depressive symptoms
Anxiety assessed with: STAI‐state anxiety subscale Scale from 20 to 80 follow‐up: 8 weeks The mean anxiety was 47.4 score. MD 7.5 score lower (13.11 lower to 1.89 lower) 78 (1 RCT) ⊕⊕⊝⊝ LOW a b Moderate effect size;c lower score represents lower level of anxiety
Carer burden assessed with: MBCBS‐subjective stress burden, RMBPC‐reaction, ZBI Scale from 0 to 100 follow‐up: range 7 weeks to 8 weeks SMD 0.24 SD higher (0.11 lower to 0.58 higher) 135 (3 RCTs) ⊕⊕⊝⊝ LOW a b Small effect size;c lower score represents lower level of carer burden
Dropout rates follow‐up: range 7 weeks to 8 weeks Study population Study population RR 1.39 (0.32 to 5.99) 166 (4 RCTs) ⊕⊝⊝⊝ VERY LOW a d Important effect sizee
Dropout rates follow‐up: range 7 weeks to 8 weeks 88 per 1000 Study population RR 1.39 (0.32 to 5.99) 166 (4 RCTs) ⊕⊝⊝⊝ VERY LOW a d 122 per 1000 (28 to 524)
*The risk in the intervention group (and its 95% confidence interval) is based on the assumed risk in the comparison group and the relative effect of the intervention (and its 95% CI).CESD: Center for Epidemiological Studies Depression Scale; CI: confidence interval; MBCBS: Montgomery Borgatta Caregiver Burden Scale; MD: mean difference; POMS: Profile of Mood States: RCT: randomised controlled trial; RMBPC: Revised Memory and Behavior Problems Checklist; RR: risk ratio; SD: standard deviation; SMD: standardised mean difference; STAI: State‐Trait Anxiety Inventory; ZBI: Zarit Burden Interview *The risk in the intervention group (and its 95% confidence interval) is based on the assumed risk in the comparison group and the relative effect of the intervention (and its 95% CI).CESD: Center for Epidemiological Studies Depression Scale; CI: confidence interval; MBCBS: Montgomery Borgatta Caregiver Burden Scale; MD: mean difference; POMS: Profile of Mood States: RCT: randomised controlled trial; RMBPC: Revised Memory and Behavior Problems Checklist; RR: risk ratio; SD: standard deviation; SMD: standardised mean difference; STAI: State‐Trait Anxiety Inventory; ZBI: Zarit Burden Interview *The risk in the intervention group (and its 95% confidence interval) is based on the assumed risk in the comparison group and the relative effect of the intervention (and its 95% CI).CESD: Center for Epidemiological Studies Depression Scale; CI: confidence interval; MBCBS: Montgomery Borgatta Caregiver Burden Scale; MD: mean difference; POMS: Profile of Mood States: RCT: randomised controlled trial; RMBPC: Revised Memory and Behavior Problems Checklist; RR: risk ratio; SD: standard deviation; SMD: standardised mean difference; STAI: State‐Trait Anxiety Inventory; ZBI: Zarit Burden Interview *The risk in the intervention group (and its 95% confidence interval) is based on the assumed risk in the comparison group and the relative effect of the intervention (and its 95% CI).CESD: Center for Epidemiological Studies Depression Scale; CI: confidence interval; MBCBS: Montgomery Borgatta Caregiver Burden Scale; MD: mean difference; POMS: Profile of Mood States: RCT: randomised controlled trial; RMBPC: Revised Memory and Behavior Problems Checklist; RR: risk ratio; SD: standard deviation; SMD: standardised mean difference; STAI: State‐Trait Anxiety Inventory; ZBI: Zarit Burden Interview *The risk in the intervention group (and its 95% confidence interval) is based on the assumed risk in the comparison group and the relative effect of the intervention (and its 95% CI).CESD: Center for Epidemiological Studies Depression Scale; CI: confidence interval; MBCBS: Montgomery Borgatta Caregiver Burden Scale; MD: mean difference; POMS: Profile of Mood States: RCT: randomised controlled trial; RMBPC: Revised Memory and Behavior Problems Checklist; RR: risk ratio; SD: standard deviation; SMD: standardised mean difference; STAI: State‐Trait Anxiety Inventory; ZBI: Zarit Burden Interview *The risk in the intervention group (and its 95% confidence interval) is based on the assumed risk in the comparison group and the relative effect of the intervention (and its 95% CI).CESD: Center for Epidemiological Studies Depression Scale; CI: confidence interval; MBCBS: Montgomery Borgatta Caregiver Burden Scale; MD: mean difference; POMS: Profile of Mood States: RCT: randomised controlled trial; RMBPC: Revised Memory and Behavior Problems Checklist; RR: risk ratio; SD: standard deviation; SMD: standardised mean difference; STAI: State‐Trait Anxiety Inventory; ZBI: Zarit Burden Interview *The risk in the intervention group (and its 95% confidence interval) is based on the assumed risk in the comparison group and the relative effect of the intervention (and its 95% CI).CESD: Center for Epidemiological Studies Depression Scale; CI: confidence interval; MBCBS: Montgomery Borgatta Caregiver Burden Scale; MD: mean difference; POMS: Profile of Mood States: RCT: randomised controlled trial; RMBPC: Revised Memory and Behavior Problems Checklist; RR: risk ratio; SD: standard deviation; SMD: standardised mean difference; STAI: State‐Trait Anxiety Inventory; ZBI: Zarit Burden Interview
GRADE Working Group grades of evidenceHigh quality: We are very confident that the true effect lies close to that of the estimate of the effect.Moderate quality: We are moderately confident in the effect estimate: the true effect is likely to be close to the estimate of the effect, but there is a possibility that it is substantially different.Low quality: Our confidence in the effect estimate is limited: the true effect may be substantially different from the estimate of the effect.Very low quality: We have very little confidence in the effect estimate: the true effect is likely to be substantially different from the estimate of effect. GRADE Working Group grades of evidenceHigh quality: We are very confident that the true effect lies close to that of the estimate of the effect.Moderate quality: We are moderately confident in the effect estimate: the true effect is likely to be close to the estimate of the effect, but there is a possibility that it is substantially different.Low quality: Our confidence in the effect estimate is limited: the true effect may be substantially different from the estimate of the effect.Very low quality: We have very little confidence in the effect estimate: the true effect is likely to be substantially different from the estimate of effect. GRADE Working Group grades of evidenceHigh quality: We are very confident that the true effect lies close to that of the estimate of the effect.Moderate quality: We are moderately confident in the effect estimate: the true effect is likely to be close to the estimate of the effect, but there is a possibility that it is substantially different.Low quality: Our confidence in the effect estimate is limited: the true effect may be substantially different from the estimate of the effect.Very low quality: We have very little confidence in the effect estimate: the true effect is likely to be substantially different from the estimate of effect. GRADE Working Group grades of evidenceHigh quality: We are very confident that the true effect lies close to that of the estimate of the effect.Moderate quality: We are moderately confident in the effect estimate: the true effect is likely to be close to the estimate of the effect, but there is a possibility that it is substantially different.Low quality: Our confidence in the effect estimate is limited: the true effect may be substantially different from the estimate of the effect.Very low quality: We have very little confidence in the effect estimate: the true effect is likely to be substantially different from the estimate of effect. GRADE Working Group grades of evidenceHigh quality: We are very confident that the true effect lies close to that of the estimate of the effect.Moderate quality: We are moderately confident in the effect estimate: the true effect is likely to be close to the estimate of the effect, but there is a possibility that it is substantially different.Low quality: Our confidence in the effect estimate is limited: the true effect may be substantially different from the estimate of the effect.Very low quality: We have very little confidence in the effect estimate: the true effect is likely to be substantially different from the estimate of effect. GRADE Working Group grades of evidenceHigh quality: We are very confident that the true effect lies close to that of the estimate of the effect.Moderate quality: We are moderately confident in the effect estimate: the true effect is likely to be close to the estimate of the effect, but there is a possibility that it is substantially different.Low quality: Our confidence in the effect estimate is limited: the true effect may be substantially different from the estimate of the effect.Very low quality: We have very little confidence in the effect estimate: the true effect is likely to be substantially different from the estimate of effect. GRADE Working Group grades of evidenceHigh quality: We are very confident that the true effect lies close to that of the estimate of the effect.Moderate quality: We are moderately confident in the effect estimate: the true effect is likely to be close to the estimate of the effect, but there is a possibility that it is substantially different.Low quality: Our confidence in the effect estimate is limited: the true effect may be substantially different from the estimate of the effect.Very low quality: We have very little confidence in the effect estimate: the true effect is likely to be substantially different from the estimate of effect.
aWe downgraded the quality of evidence by one level due to serious concern about high risk of bias in blinding of participants and personnel.bAs suggested by Ryan 2016a, we downgraded the quality of evidence by one level due to serious concern about imprecision.cTo assess the magnitude of effect for continuous outcomes, we used the criteria suggested by Cohen 1988: 0.2 represents a small effect, 0.5 a moderate effect, and 0.8 a large effect.dAs suggested by Ryan 2016a, we downgraded the quality of evidence by two levels due to very serious concern about imprecision.eAs suggested by Ryan 2016b, we considered an RR < 0.75 or RR > 1.25 an important effect size for dichotomous outcomes. aWe downgraded the quality of evidence by one level due to serious concern about high risk of bias in blinding of participants and personnel.bAs suggested by Ryan 2016a, we downgraded the quality of evidence by one level due to serious concern about imprecision.cTo assess the magnitude of effect for continuous outcomes, we used the criteria suggested by Cohen 1988: 0.2 represents a small effect, 0.5 a moderate effect, and 0.8 a large effect.dAs suggested by Ryan 2016a, we downgraded the quality of evidence by two levels due to very serious concern about imprecision.eAs suggested by Ryan 2016b, we considered an RR < 0.75 or RR > 1.25 an important effect size for dichotomous outcomes. aWe downgraded the quality of evidence by one level due to serious concern about high risk of bias in blinding of participants and personnel.bAs suggested by Ryan 2016a, we downgraded the quality of evidence by one level due to serious concern about imprecision.cTo assess the magnitude of effect for continuous outcomes, we used the criteria suggested by Cohen 1988: 0.2 represents a small effect, 0.5 a moderate effect, and 0.8 a large effect.dAs suggested by Ryan 2016a, we downgraded the quality of evidence by two levels due to very serious concern about imprecision.eAs suggested by Ryan 2016b, we considered an RR < 0.75 or RR > 1.25 an important effect size for dichotomous outcomes. aWe downgraded the quality of evidence by one level due to serious concern about high risk of bias in blinding of participants and personnel.bAs suggested by Ryan 2016a, we downgraded the quality of evidence by one level due to serious concern about imprecision.cTo assess the magnitude of effect for continuous outcomes, we used the criteria suggested by Cohen 1988: 0.2 represents a small effect, 0.5 a moderate effect, and 0.8 a large effect.dAs suggested by Ryan 2016a, we downgraded the quality of evidence by two levels due to very serious concern about imprecision.eAs suggested by Ryan 2016b, we considered an RR < 0.75 or RR > 1.25 an important effect size for dichotomous outcomes. aWe downgraded the quality of evidence by one level due to serious concern about high risk of bias in blinding of participants and personnel.bAs suggested by Ryan 2016a, we downgraded the quality of evidence by one level due to serious concern about imprecision.cTo assess the magnitude of effect for continuous outcomes, we used the criteria suggested by Cohen 1988: 0.2 represents a small effect, 0.5 a moderate effect, and 0.8 a large effect.dAs suggested by Ryan 2016a, we downgraded the quality of evidence by two levels due to very serious concern about imprecision.eAs suggested by Ryan 2016b, we considered an RR < 0.75 or RR > 1.25 an important effect size for dichotomous outcomes. aWe downgraded the quality of evidence by one level due to serious concern about high risk of bias in blinding of participants and personnel.bAs suggested by Ryan 2016a, we downgraded the quality of evidence by one level due to serious concern about imprecision.cTo assess the magnitude of effect for continuous outcomes, we used the criteria suggested by Cohen 1988: 0.2 represents a small effect, 0.5 a moderate effect, and 0.8 a large effect.dAs suggested by Ryan 2016a, we downgraded the quality of evidence by two levels due to very serious concern about imprecision.eAs suggested by Ryan 2016b, we considered an RR < 0.75 or RR > 1.25 an important effect size for dichotomous outcomes. aWe downgraded the quality of evidence by one level due to serious concern about high risk of bias in blinding of participants and personnel.bAs suggested by Ryan 2016a, we downgraded the quality of evidence by one level due to serious concern about imprecision.cTo assess the magnitude of effect for continuous outcomes, we used the criteria suggested by Cohen 1988: 0.2 represents a small effect, 0.5 a moderate effect, and 0.8 a large effect.dAs suggested by Ryan 2016a, we downgraded the quality of evidence by two levels due to very serious concern about imprecision.eAs suggested by Ryan 2016b, we considered an RR < 0.75 or RR > 1.25 an important effect size for dichotomous outcomes.
summaryOfFindingsTable <- 
  summaryOfFindingsTable %>% 
  mutate(id = row_number()) %>% 
  select(id, everything())

head(summaryOfFindingsTable)
id X1 X2 X3 X4 X5 X6 X7
1 MBSR (mindfulness‐based stress reduction) compared to active control for family carers of people with dementia MBSR (mindfulness‐based stress reduction) compared to active control for family carers of people with dementia MBSR (mindfulness‐based stress reduction) compared to active control for family carers of people with dementia MBSR (mindfulness‐based stress reduction) compared to active control for family carers of people with dementia MBSR (mindfulness‐based stress reduction) compared to active control for family carers of people with dementia MBSR (mindfulness‐based stress reduction) compared to active control for family carers of people with dementia MBSR (mindfulness‐based stress reduction) compared to active control for family carers of people with dementia
2 Patient or population: family carers of people with dementiaSetting: communityIntervention: MBSRComparison: active control Patient or population: family carers of people with dementiaSetting: communityIntervention: MBSRComparison: active control Patient or population: family carers of people with dementiaSetting: communityIntervention: MBSRComparison: active control Patient or population: family carers of people with dementiaSetting: communityIntervention: MBSRComparison: active control Patient or population: family carers of people with dementiaSetting: communityIntervention: MBSRComparison: active control Patient or population: family carers of people with dementiaSetting: communityIntervention: MBSRComparison: active control Patient or population: family carers of people with dementiaSetting: communityIntervention: MBSRComparison: active control
3 Outcomes Anticipated absolute effects* (95% CI) Anticipated absolute effects* (95% CI) Relative effect (95% CI) № of participants (studies) Quality of the evidence (GRADE) Comments
4 Outcomes Risk with active control Anticipated absolute effects* (95% CI) Relative effect (95% CI) № of participants (studies) Quality of the evidence (GRADE) Risk with MBSR
5 Depressive symptoms assessed with: CESD, POMS‐depression subscale Scale from 0 to 60 follow‐up: range 7 weeks to 8 weeks SMD 0.63 SD lower (0.98 lower to 0.28 lower) 135 (3 RCTs) ⊕⊕⊝⊝ LOW a b Moderate effect size;c lower score represents lower depressive symptoms
6 Anxiety assessed with: STAI‐state anxiety subscale Scale from 20 to 80 follow‐up: 8 weeks The mean anxiety was 47.4 score. MD 7.5 score lower (13.11 lower to 1.89 lower) 78 (1 RCT) ⊕⊕⊝⊝ LOW a b Moderate effect size;c lower score represents lower level of anxiety

Much better! The first two lines appear to be the table legend. The two lines with Outcomes are the read header line (split in two, as we actually have two header lines; compare with the source).

7 Extract (Primary) Outcomes with the GRADE

Very well then, let’s extract the outcomes with their respective GRADE.

7.1 Get column with outcomes

Find the column where the outcomes are stored and rename it to Outcomes.

col_Outcomes <- 
  summaryOfFindingsTable %>% 
  map(~ str_detect(., pattern = "Outcome")) %>% 
  map_lgl(~ any(. == TRUE)) %>% 
  which()

col_Outcomes
#> X1 
#>  2
names(summaryOfFindingsTable)[col_Outcomes] <- "Outcomes"

names(summaryOfFindingsTable)
#> [1] "id"       "Outcomes" "X2"       "X3"       "X4"       "X5"       "X6"      
#> [8] "X7"

7.2 Delete non-data rows

Also delete all rows that do not contain data.

To that end, find the highest row number of a row containing the word “Outcomes”.

max_row_to_delete <- 
  summaryOfFindingsTable %>% 
  filter(Outcomes == "Outcomes") %>% 
  pull(id) %>% 
  max()

max_row_to_delete
#> [1] 4
summaryOfFindingsTable2 <-
  summaryOfFindingsTable %>% 
  filter(id > max_row_to_delete)

summaryOfFindingsTable2
id Outcomes X2 X3 X4 X5 X6 X7
5 Depressive symptoms assessed with: CESD, POMS‐depression subscale Scale from 0 to 60 follow‐up: range 7 weeks to 8 weeks SMD 0.63 SD lower (0.98 lower to 0.28 lower) 135 (3 RCTs) ⊕⊕⊝⊝ LOW a b Moderate effect size;c lower score represents lower depressive symptoms
6 Anxiety assessed with: STAI‐state anxiety subscale Scale from 20 to 80 follow‐up: 8 weeks The mean anxiety was 47.4 score. MD 7.5 score lower (13.11 lower to 1.89 lower) 78 (1 RCT) ⊕⊕⊝⊝ LOW a b Moderate effect size;c lower score represents lower level of anxiety
7 Carer burden assessed with: MBCBS‐subjective stress burden, RMBPC‐reaction, ZBI Scale from 0 to 100 follow‐up: range 7 weeks to 8 weeks SMD 0.24 SD higher (0.11 lower to 0.58 higher) 135 (3 RCTs) ⊕⊕⊝⊝ LOW a b Small effect size;c lower score represents lower level of carer burden
8 Dropout rates follow‐up: range 7 weeks to 8 weeks Study population Study population RR 1.39 (0.32 to 5.99) 166 (4 RCTs) ⊕⊝⊝⊝ VERY LOW a d Important effect sizee
9 Dropout rates follow‐up: range 7 weeks to 8 weeks 88 per 1000 Study population RR 1.39 (0.32 to 5.99) 166 (4 RCTs) ⊕⊝⊝⊝ VERY LOW a d 122 per 1000 (28 to 524)
10 *The risk in the intervention group (and its 95% confidence interval) is based on the assumed risk in the comparison group and the relative effect of the intervention (and its 95% CI).CESD: Center for Epidemiological Studies Depression Scale; CI: confidence interval; MBCBS: Montgomery Borgatta Caregiver Burden Scale; MD: mean difference; POMS: Profile of Mood States: RCT: randomised controlled trial; RMBPC: Revised Memory and Behavior Problems Checklist; RR: risk ratio; SD: standard deviation; SMD: standardised mean difference; STAI: State‐Trait Anxiety Inventory; ZBI: Zarit Burden Interview *The risk in the intervention group (and its 95% confidence interval) is based on the assumed risk in the comparison group and the relative effect of the intervention (and its 95% CI).CESD: Center for Epidemiological Studies Depression Scale; CI: confidence interval; MBCBS: Montgomery Borgatta Caregiver Burden Scale; MD: mean difference; POMS: Profile of Mood States: RCT: randomised controlled trial; RMBPC: Revised Memory and Behavior Problems Checklist; RR: risk ratio; SD: standard deviation; SMD: standardised mean difference; STAI: State‐Trait Anxiety Inventory; ZBI: Zarit Burden Interview *The risk in the intervention group (and its 95% confidence interval) is based on the assumed risk in the comparison group and the relative effect of the intervention (and its 95% CI).CESD: Center for Epidemiological Studies Depression Scale; CI: confidence interval; MBCBS: Montgomery Borgatta Caregiver Burden Scale; MD: mean difference; POMS: Profile of Mood States: RCT: randomised controlled trial; RMBPC: Revised Memory and Behavior Problems Checklist; RR: risk ratio; SD: standard deviation; SMD: standardised mean difference; STAI: State‐Trait Anxiety Inventory; ZBI: Zarit Burden Interview *The risk in the intervention group (and its 95% confidence interval) is based on the assumed risk in the comparison group and the relative effect of the intervention (and its 95% CI).CESD: Center for Epidemiological Studies Depression Scale; CI: confidence interval; MBCBS: Montgomery Borgatta Caregiver Burden Scale; MD: mean difference; POMS: Profile of Mood States: RCT: randomised controlled trial; RMBPC: Revised Memory and Behavior Problems Checklist; RR: risk ratio; SD: standard deviation; SMD: standardised mean difference; STAI: State‐Trait Anxiety Inventory; ZBI: Zarit Burden Interview *The risk in the intervention group (and its 95% confidence interval) is based on the assumed risk in the comparison group and the relative effect of the intervention (and its 95% CI).CESD: Center for Epidemiological Studies Depression Scale; CI: confidence interval; MBCBS: Montgomery Borgatta Caregiver Burden Scale; MD: mean difference; POMS: Profile of Mood States: RCT: randomised controlled trial; RMBPC: Revised Memory and Behavior Problems Checklist; RR: risk ratio; SD: standard deviation; SMD: standardised mean difference; STAI: State‐Trait Anxiety Inventory; ZBI: Zarit Burden Interview *The risk in the intervention group (and its 95% confidence interval) is based on the assumed risk in the comparison group and the relative effect of the intervention (and its 95% CI).CESD: Center for Epidemiological Studies Depression Scale; CI: confidence interval; MBCBS: Montgomery Borgatta Caregiver Burden Scale; MD: mean difference; POMS: Profile of Mood States: RCT: randomised controlled trial; RMBPC: Revised Memory and Behavior Problems Checklist; RR: risk ratio; SD: standard deviation; SMD: standardised mean difference; STAI: State‐Trait Anxiety Inventory; ZBI: Zarit Burden Interview *The risk in the intervention group (and its 95% confidence interval) is based on the assumed risk in the comparison group and the relative effect of the intervention (and its 95% CI).CESD: Center for Epidemiological Studies Depression Scale; CI: confidence interval; MBCBS: Montgomery Borgatta Caregiver Burden Scale; MD: mean difference; POMS: Profile of Mood States: RCT: randomised controlled trial; RMBPC: Revised Memory and Behavior Problems Checklist; RR: risk ratio; SD: standard deviation; SMD: standardised mean difference; STAI: State‐Trait Anxiety Inventory; ZBI: Zarit Burden Interview
11 GRADE Working Group grades of evidenceHigh quality: We are very confident that the true effect lies close to that of the estimate of the effect.Moderate quality: We are moderately confident in the effect estimate: the true effect is likely to be close to the estimate of the effect, but there is a possibility that it is substantially different.Low quality: Our confidence in the effect estimate is limited: the true effect may be substantially different from the estimate of the effect.Very low quality: We have very little confidence in the effect estimate: the true effect is likely to be substantially different from the estimate of effect. GRADE Working Group grades of evidenceHigh quality: We are very confident that the true effect lies close to that of the estimate of the effect.Moderate quality: We are moderately confident in the effect estimate: the true effect is likely to be close to the estimate of the effect, but there is a possibility that it is substantially different.Low quality: Our confidence in the effect estimate is limited: the true effect may be substantially different from the estimate of the effect.Very low quality: We have very little confidence in the effect estimate: the true effect is likely to be substantially different from the estimate of effect. GRADE Working Group grades of evidenceHigh quality: We are very confident that the true effect lies close to that of the estimate of the effect.Moderate quality: We are moderately confident in the effect estimate: the true effect is likely to be close to the estimate of the effect, but there is a possibility that it is substantially different.Low quality: Our confidence in the effect estimate is limited: the true effect may be substantially different from the estimate of the effect.Very low quality: We have very little confidence in the effect estimate: the true effect is likely to be substantially different from the estimate of effect. GRADE Working Group grades of evidenceHigh quality: We are very confident that the true effect lies close to that of the estimate of the effect.Moderate quality: We are moderately confident in the effect estimate: the true effect is likely to be close to the estimate of the effect, but there is a possibility that it is substantially different.Low quality: Our confidence in the effect estimate is limited: the true effect may be substantially different from the estimate of the effect.Very low quality: We have very little confidence in the effect estimate: the true effect is likely to be substantially different from the estimate of effect. GRADE Working Group grades of evidenceHigh quality: We are very confident that the true effect lies close to that of the estimate of the effect.Moderate quality: We are moderately confident in the effect estimate: the true effect is likely to be close to the estimate of the effect, but there is a possibility that it is substantially different.Low quality: Our confidence in the effect estimate is limited: the true effect may be substantially different from the estimate of the effect.Very low quality: We have very little confidence in the effect estimate: the true effect is likely to be substantially different from the estimate of effect. GRADE Working Group grades of evidenceHigh quality: We are very confident that the true effect lies close to that of the estimate of the effect.Moderate quality: We are moderately confident in the effect estimate: the true effect is likely to be close to the estimate of the effect, but there is a possibility that it is substantially different.Low quality: Our confidence in the effect estimate is limited: the true effect may be substantially different from the estimate of the effect.Very low quality: We have very little confidence in the effect estimate: the true effect is likely to be substantially different from the estimate of effect. GRADE Working Group grades of evidenceHigh quality: We are very confident that the true effect lies close to that of the estimate of the effect.Moderate quality: We are moderately confident in the effect estimate: the true effect is likely to be close to the estimate of the effect, but there is a possibility that it is substantially different.Low quality: Our confidence in the effect estimate is limited: the true effect may be substantially different from the estimate of the effect.Very low quality: We have very little confidence in the effect estimate: the true effect is likely to be substantially different from the estimate of effect.
12 aWe downgraded the quality of evidence by one level due to serious concern about high risk of bias in blinding of participants and personnel.bAs suggested by Ryan 2016a, we downgraded the quality of evidence by one level due to serious concern about imprecision.cTo assess the magnitude of effect for continuous outcomes, we used the criteria suggested by Cohen 1988: 0.2 represents a small effect, 0.5 a moderate effect, and 0.8 a large effect.dAs suggested by Ryan 2016a, we downgraded the quality of evidence by two levels due to very serious concern about imprecision.eAs suggested by Ryan 2016b, we considered an RR < 0.75 or RR > 1.25 an important effect size for dichotomous outcomes. aWe downgraded the quality of evidence by one level due to serious concern about high risk of bias in blinding of participants and personnel.bAs suggested by Ryan 2016a, we downgraded the quality of evidence by one level due to serious concern about imprecision.cTo assess the magnitude of effect for continuous outcomes, we used the criteria suggested by Cohen 1988: 0.2 represents a small effect, 0.5 a moderate effect, and 0.8 a large effect.dAs suggested by Ryan 2016a, we downgraded the quality of evidence by two levels due to very serious concern about imprecision.eAs suggested by Ryan 2016b, we considered an RR < 0.75 or RR > 1.25 an important effect size for dichotomous outcomes. aWe downgraded the quality of evidence by one level due to serious concern about high risk of bias in blinding of participants and personnel.bAs suggested by Ryan 2016a, we downgraded the quality of evidence by one level due to serious concern about imprecision.cTo assess the magnitude of effect for continuous outcomes, we used the criteria suggested by Cohen 1988: 0.2 represents a small effect, 0.5 a moderate effect, and 0.8 a large effect.dAs suggested by Ryan 2016a, we downgraded the quality of evidence by two levels due to very serious concern about imprecision.eAs suggested by Ryan 2016b, we considered an RR < 0.75 or RR > 1.25 an important effect size for dichotomous outcomes. aWe downgraded the quality of evidence by one level due to serious concern about high risk of bias in blinding of participants and personnel.bAs suggested by Ryan 2016a, we downgraded the quality of evidence by one level due to serious concern about imprecision.cTo assess the magnitude of effect for continuous outcomes, we used the criteria suggested by Cohen 1988: 0.2 represents a small effect, 0.5 a moderate effect, and 0.8 a large effect.dAs suggested by Ryan 2016a, we downgraded the quality of evidence by two levels due to very serious concern about imprecision.eAs suggested by Ryan 2016b, we considered an RR < 0.75 or RR > 1.25 an important effect size for dichotomous outcomes. aWe downgraded the quality of evidence by one level due to serious concern about high risk of bias in blinding of participants and personnel.bAs suggested by Ryan 2016a, we downgraded the quality of evidence by one level due to serious concern about imprecision.cTo assess the magnitude of effect for continuous outcomes, we used the criteria suggested by Cohen 1988: 0.2 represents a small effect, 0.5 a moderate effect, and 0.8 a large effect.dAs suggested by Ryan 2016a, we downgraded the quality of evidence by two levels due to very serious concern about imprecision.eAs suggested by Ryan 2016b, we considered an RR < 0.75 or RR > 1.25 an important effect size for dichotomous outcomes. aWe downgraded the quality of evidence by one level due to serious concern about high risk of bias in blinding of participants and personnel.bAs suggested by Ryan 2016a, we downgraded the quality of evidence by one level due to serious concern about imprecision.cTo assess the magnitude of effect for continuous outcomes, we used the criteria suggested by Cohen 1988: 0.2 represents a small effect, 0.5 a moderate effect, and 0.8 a large effect.dAs suggested by Ryan 2016a, we downgraded the quality of evidence by two levels due to very serious concern about imprecision.eAs suggested by Ryan 2016b, we considered an RR < 0.75 or RR > 1.25 an important effect size for dichotomous outcomes. aWe downgraded the quality of evidence by one level due to serious concern about high risk of bias in blinding of participants and personnel.bAs suggested by Ryan 2016a, we downgraded the quality of evidence by one level due to serious concern about imprecision.cTo assess the magnitude of effect for continuous outcomes, we used the criteria suggested by Cohen 1988: 0.2 represents a small effect, 0.5 a moderate effect, and 0.8 a large effect.dAs suggested by Ryan 2016a, we downgraded the quality of evidence by two levels due to very serious concern about imprecision.eAs suggested by Ryan 2016b, we considered an RR < 0.75 or RR > 1.25 an important effect size for dichotomous outcomes.

9 Type of Effect (effect statistic)

Let’s use the summary table to extract the type of effect, that is, the statistic used for reporting the effect size. Out of the front of my brain, I’d see the following statistics here:

  • SMD (standardized mean difference)
  • RR (relative risk)
  • OR (odds ratio)
  • MD (mean difference, ie., unstandardized)
  • Mean score
effect_statistic <- "SMD|RR|OR|MD|[M|m]ean.+[S|s]core"
effect_statistic_per_outcome <- 
  summaryOfFindingsTable4 %>% 
  map_dfc(~ tibble(col = str_extract(., pattern = effect_statistic))) %>% 
  pmap_chr(.f = paste) %>% 
  map_chr(~ str_remove_all(., pattern = "NA | NA"))
summaryOfFindingsTable5 <-
  summaryOfFindingsTable4 %>% 
  mutate(effect_statistic = effect_statistic_per_outcome)

10 Number of trials and participants per outcome

Let’s look in the header to see in which column we would find something like “№ of participants (studies)” (or some derivates of this string).

n_of_trials_string <- "of participants \\(studies\\)"
col_participants_studies <- 
  summaryOfFindingsTable %>% 
  map( ~ str_detect(., pattern = n_of_trials_string)) %>% 
  map_lgl( ~ any(. == TRUE)) %>% 
  keep(isTRUE) %>% 
  names()

col_participants_studies
#> [1] "X5"
summaryOfFindingsTable6 <-
  summaryOfFindingsTable5 %>% 
  rename(n_participants_studies := {col_participants_studies}) %>% 
  separate(col = n_participants_studies, sep = "\\(",
           into = c("n_participants", "n_studies"),
           remove = FALSE)


summaryOfFindingsTable6
id Outcomes X2 X3 X4 n_participants_studies n_participants n_studies GRADE X7 effect_statistic
5 Depressive symptoms assessed with: CESD, POMS‐depression subscale Scale from 0 to 60 follow‐up: range 7 weeks to 8 weeks SMD 0.63 SD lower (0.98 lower to 0.28 lower) 135 (3 RCTs) 135 3 RCTs) ⊕⊕⊝⊝ LOW a b Moderate effect size;c lower score represents lower depressive symptoms SMD
6 Anxiety assessed with: STAI‐state anxiety subscale Scale from 20 to 80 follow‐up: 8 weeks The mean anxiety was 47.4 score. MD 7.5 score lower (13.11 lower to 1.89 lower) 78 (1 RCT) 78 1 RCT) ⊕⊕⊝⊝ LOW a b Moderate effect size;c lower score represents lower level of anxiety mean anxiety was 47.4 score MD
7 Carer burden assessed with: MBCBS‐subjective stress burden, RMBPC‐reaction, ZBI Scale from 0 to 100 follow‐up: range 7 weeks to 8 weeks SMD 0.24 SD higher (0.11 lower to 0.58 higher) 135 (3 RCTs) 135 3 RCTs) ⊕⊕⊝⊝ LOW a b Small effect size;c lower score represents lower level of carer burden SMD
8 Dropout rates follow‐up: range 7 weeks to 8 weeks Study population Study population RR 1.39 (0.32 to 5.99) 166 (4 RCTs) 166 4 RCTs) ⊕⊝⊝⊝ VERY LOW a d Important effect sizee RR
9 Dropout rates follow‐up: range 7 weeks to 8 weeks 88 per 1000 Study population RR 1.39 (0.32 to 5.99) 166 (4 RCTs) 166 4 RCTs) ⊕⊝⊝⊝ VERY LOW a d 122 per 1000 (28 to 524) RR

11 Confidence interval for relative risk per outcome

rel_eff_CI_str <- "Relative effect \\(95% CI\\)"
col_rel_eff_CI <- 
  summaryOfFindingsTable %>% 
  map( ~ str_detect(., pattern = rel_eff_CI_str)) %>% 
  map_lgl( ~ any(. == TRUE)) %>% 
  keep(isTRUE) %>% 
  names()

col_rel_eff_CI
#> [1] "X4"
summaryOfFindingsTable7 <- 
  summaryOfFindingsTable6 %>% 
  rename(relative_effect_95CI := {col_rel_eff_CI}) %>% 
  mutate(CI_lower = str_extract(relative_effect_95CI,
                                "\\(\\d+\\.*\\d+")) %>% 
  mutate(CI_lower = str_remove(CI_lower, "\\(")) %>% 
  mutate(CI_upper = str_extract(relative_effect_95CI,
                                "\\d+\\.*\\d+\\)")) %>% 
  mutate(CI_upper = str_remove(CI_upper, "\\)"))

summaryOfFindingsTable7
id Outcomes X2 X3 relative_effect_95CI n_participants_studies n_participants n_studies GRADE X7 effect_statistic CI_lower CI_upper
5 Depressive symptoms assessed with: CESD, POMS‐depression subscale Scale from 0 to 60 follow‐up: range 7 weeks to 8 weeks SMD 0.63 SD lower (0.98 lower to 0.28 lower) 135 (3 RCTs) 135 3 RCTs) ⊕⊕⊝⊝ LOW a b Moderate effect size;c lower score represents lower depressive symptoms SMD NA NA
6 Anxiety assessed with: STAI‐state anxiety subscale Scale from 20 to 80 follow‐up: 8 weeks The mean anxiety was 47.4 score. MD 7.5 score lower (13.11 lower to 1.89 lower) 78 (1 RCT) 78 1 RCT) ⊕⊕⊝⊝ LOW a b Moderate effect size;c lower score represents lower level of anxiety mean anxiety was 47.4 score MD NA NA
7 Carer burden assessed with: MBCBS‐subjective stress burden, RMBPC‐reaction, ZBI Scale from 0 to 100 follow‐up: range 7 weeks to 8 weeks SMD 0.24 SD higher (0.11 lower to 0.58 higher) 135 (3 RCTs) 135 3 RCTs) ⊕⊕⊝⊝ LOW a b Small effect size;c lower score represents lower level of carer burden SMD NA NA
8 Dropout rates follow‐up: range 7 weeks to 8 weeks Study population Study population RR 1.39 (0.32 to 5.99) 166 (4 RCTs) 166 4 RCTs) ⊕⊝⊝⊝ VERY LOW a d Important effect sizee RR 0.32 5.99
9 Dropout rates follow‐up: range 7 weeks to 8 weeks 88 per 1000 Study population RR 1.39 (0.32 to 5.99) 166 (4 RCTs) 166 4 RCTs) ⊕⊝⊝⊝ VERY LOW a d 122 per 1000 (28 to 524) RR 0.32 5.99

12 Debrief

As of this quick and dirty analysis, it seems that relevant information can be scraped from the Cochrane Library website. However, this hinges on the condition that other reviews, that is, other sites at Cochrane Library are structured in the same way. To be determined.

In addition, it would be much more elegant to use the Cochrane API. However, so far I’ve found no way how to be granted access.

It might seem that the whole fuss is like shooting sparrows with canons: All the information parsed could have easily been copy-and-pasted. True. However assume you’d like to parse a great number of pages. In that case you’d be glad to have some machine doing the boring stuff for you.

To get it running for production, all the stuff here should go into some neat function, so that the scraping can be applies smoothly to more pages.

13 Reproducibility

#> ─ Session info ───────────────────────────────────────────────────────────────────────────────────────────────────────
#>  setting  value                       
#>  version  R version 4.0.2 (2020-06-22)
#>  os       macOS  10.16                
#>  system   x86_64, darwin17.0          
#>  ui       X11                         
#>  language (EN)                        
#>  collate  en_US.UTF-8                 
#>  ctype    en_US.UTF-8                 
#>  tz       Europe/Berlin               
#>  date     2021-02-19                  
#> 
#> ─ Packages ───────────────────────────────────────────────────────────────────────────────────────────────────────────
#>  package     * version     date       lib source                             
#>  assertthat    0.2.1       2019-03-21 [1] CRAN (R 4.0.0)                     
#>  backports     1.2.1       2020-12-09 [1] CRAN (R 4.0.2)                     
#>  blogdown      1.1         2021-01-19 [1] CRAN (R 4.0.2)                     
#>  bookdown      0.21.6      2021-02-02 [1] Github (rstudio/bookdown@6c7346a)  
#>  broom         0.7.4       2021-01-29 [1] CRAN (R 4.0.2)                     
#>  bslib         0.2.4.9000  2021-02-02 [1] Github (rstudio/bslib@b3cd7a9)     
#>  cachem        1.0.1       2021-01-21 [1] CRAN (R 4.0.2)                     
#>  callr         3.5.1       2020-10-13 [1] CRAN (R 4.0.2)                     
#>  cellranger    1.1.0       2016-07-27 [1] CRAN (R 4.0.0)                     
#>  cli           2.3.0       2021-01-31 [1] CRAN (R 4.0.2)                     
#>  codetools     0.2-16      2018-12-24 [2] CRAN (R 4.0.2)                     
#>  colorspace    2.0-0       2020-11-11 [1] CRAN (R 4.0.2)                     
#>  crayon        1.4.1       2021-02-08 [1] CRAN (R 4.0.2)                     
#>  curl          4.3         2019-12-02 [1] CRAN (R 4.0.0)                     
#>  DBI           1.1.1       2021-01-15 [1] CRAN (R 4.0.2)                     
#>  dbplyr        2.0.0       2020-11-03 [1] CRAN (R 4.0.2)                     
#>  desc          1.2.0       2018-05-01 [1] CRAN (R 4.0.0)                     
#>  devtools      2.3.2       2020-09-18 [1] CRAN (R 4.0.2)                     
#>  digest        0.6.27      2020-10-24 [1] CRAN (R 4.0.2)                     
#>  dplyr       * 1.0.3       2021-01-15 [1] CRAN (R 4.0.2)                     
#>  ellipsis      0.3.1       2020-05-15 [1] CRAN (R 4.0.0)                     
#>  evaluate      0.14        2019-05-28 [1] CRAN (R 4.0.0)                     
#>  fastmap       1.1.0       2021-01-25 [1] CRAN (R 4.0.2)                     
#>  forcats     * 0.5.1       2021-01-27 [1] CRAN (R 4.0.2)                     
#>  fs            1.5.0       2020-07-31 [1] CRAN (R 4.0.2)                     
#>  generics      0.1.0       2020-10-31 [1] CRAN (R 4.0.2)                     
#>  ggplot2     * 3.3.3       2020-12-30 [1] CRAN (R 4.0.2)                     
#>  glue          1.4.2       2020-08-27 [1] CRAN (R 4.0.2)                     
#>  gtable        0.3.0       2019-03-25 [1] CRAN (R 4.0.0)                     
#>  haven         2.3.1       2020-06-01 [1] CRAN (R 4.0.0)                     
#>  highr         0.8         2019-03-20 [1] CRAN (R 4.0.0)                     
#>  hms           1.0.0       2021-01-13 [1] CRAN (R 4.0.2)                     
#>  htmltools     0.5.1.1     2021-01-22 [1] CRAN (R 4.0.2)                     
#>  httr          1.4.2       2020-07-20 [1] CRAN (R 4.0.2)                     
#>  jquerylib     0.1.3       2020-12-17 [1] CRAN (R 4.0.2)                     
#>  jsonlite      1.7.2       2020-12-09 [1] CRAN (R 4.0.2)                     
#>  knitr         1.31        2021-01-27 [1] CRAN (R 4.0.2)                     
#>  lifecycle     0.2.0       2020-03-06 [1] CRAN (R 4.0.0)                     
#>  lubridate     1.7.9.2     2020-11-13 [1] CRAN (R 4.0.2)                     
#>  magrittr      2.0.1       2020-11-17 [1] CRAN (R 4.0.2)                     
#>  memoise       2.0.0       2021-01-26 [1] CRAN (R 4.0.2)                     
#>  modelr        0.1.8       2020-05-19 [1] CRAN (R 4.0.0)                     
#>  munsell       0.5.0       2018-06-12 [1] CRAN (R 4.0.0)                     
#>  pillar        1.4.7       2020-11-20 [1] CRAN (R 4.0.2)                     
#>  pkgbuild      1.2.0       2020-12-15 [1] CRAN (R 4.0.2)                     
#>  pkgconfig     2.0.3       2019-09-22 [1] CRAN (R 4.0.0)                     
#>  pkgload       1.1.0       2020-05-29 [1] CRAN (R 4.0.0)                     
#>  prettyunits   1.1.1       2020-01-24 [1] CRAN (R 4.0.0)                     
#>  printr      * 0.1.1       2021-01-27 [1] CRAN (R 4.0.2)                     
#>  processx      3.4.5       2020-11-30 [1] CRAN (R 4.0.2)                     
#>  ps            1.5.0       2020-12-05 [1] CRAN (R 4.0.2)                     
#>  purrr       * 0.3.4       2020-04-17 [1] CRAN (R 4.0.0)                     
#>  R6            2.5.0       2020-10-28 [1] CRAN (R 4.0.2)                     
#>  Rcpp          1.0.6       2021-01-15 [1] CRAN (R 4.0.2)                     
#>  readr       * 1.4.0       2020-10-05 [1] CRAN (R 4.0.2)                     
#>  readxl        1.3.1       2019-03-13 [1] CRAN (R 4.0.0)                     
#>  remotes       2.2.0       2020-07-21 [1] CRAN (R 4.0.2)                     
#>  reprex        1.0.0       2021-01-27 [1] CRAN (R 4.0.2)                     
#>  rlang         0.4.10      2020-12-30 [1] CRAN (R 4.0.2)                     
#>  rmarkdown     2.6.6       2021-02-11 [1] Github (rstudio/rmarkdown@a62cb20) 
#>  rprojroot     2.0.2       2020-11-15 [1] CRAN (R 4.0.2)                     
#>  rstudioapi    0.13.0-9000 2021-02-11 [1] Github (rstudio/rstudioapi@9d21f50)
#>  rvest       * 0.3.6       2020-07-25 [1] CRAN (R 4.0.2)                     
#>  sass          0.3.1       2021-01-24 [1] CRAN (R 4.0.2)                     
#>  scales        1.1.1       2020-05-11 [1] CRAN (R 4.0.0)                     
#>  selectr       0.4-2       2019-11-20 [1] CRAN (R 4.0.0)                     
#>  sessioninfo   1.1.1       2018-11-05 [1] CRAN (R 4.0.0)                     
#>  stringi       1.5.3       2020-09-09 [1] CRAN (R 4.0.2)                     
#>  stringr     * 1.4.0       2019-02-10 [1] CRAN (R 4.0.0)                     
#>  testthat      3.0.1       2020-12-17 [1] CRAN (R 4.0.2)                     
#>  tibble      * 3.0.6       2021-01-29 [1] CRAN (R 4.0.2)                     
#>  tidyr       * 1.1.2       2020-08-27 [1] CRAN (R 4.0.2)                     
#>  tidyselect    1.1.0       2020-05-11 [1] CRAN (R 4.0.0)                     
#>  tidyverse   * 1.3.0       2019-11-21 [1] CRAN (R 4.0.0)                     
#>  usethis       2.0.0       2020-12-10 [1] CRAN (R 4.0.2)                     
#>  vctrs         0.3.6       2020-12-17 [1] CRAN (R 4.0.2)                     
#>  withr         2.4.1       2021-01-26 [1] CRAN (R 4.0.2)                     
#>  xfun          0.21        2021-02-10 [1] CRAN (R 4.0.2)                     
#>  xml2        * 1.3.2       2020-04-23 [1] CRAN (R 4.0.0)                     
#>  yaml          2.2.1       2020-02-01 [1] CRAN (R 4.0.0)                     
#> 
#> [1] /Users/sebastiansaueruser/Rlibs
#> [2] /Library/Frameworks/R.framework/Versions/4.0/Resources/library