Patterns of antibiotic use, pathogens, and prediction of mortality in hospitalized neonates and young infants with sepsis: A global neonatal sepsis observational cohort study (NeoOBS)

0
120


Citation: Russell NJ, Stöhr W, Plakkal N, Cook A, Berkley JA, Adhisivam B, et al. (2023) Patterns of antibiotic use, pathogens, and prediction of mortality in hospitalized neonates and young infants with sepsis: A global neonatal sepsis observational cohort study (NeoOBS). PLoS Med 20(6):
e1004179.

https://doi.org/10.1371/journal.pmed.1004179

Received: June 15, 2022; Accepted: January 19, 2023; Published: June 8, 2023

Copyright: © 2023 Russell et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: All relevant data are within the manuscript and its Supporting information files.

Funding: This study was funded by the Global Antibiotic Research and Development Partnership (GARDP), made possible by Bill & Melinda Gates Foundation; German Federal Ministry of Education and Research; German Federal Ministry of Health; Government of the Principality of Monaco; the Indian Council for Medical Research; Japanese Ministry of Health, Labour and Welfare; Netherlands Ministry of Health, Welfare and Sport; South African Medical Research Council; UK Department of Health and Social Care (UK National Institute of Health Research and the Global Antimicrobial Resistance Innovation Fund – GAMRIF); UK Medical Research Council; Wellcome Trust. GARDP has also received core funding from the Leo Model Foundation; Luxembourg Ministry of Development Cooperation and Humanitarian Aid; Luxembourg Ministry of Health; Médecins Sans Frontières; Swiss Federal Office of Public Health; UK Foreign, Commonwealth & Development Office (previously the UK Department for International Development). GARDP were consulted on study design. Members of the funder participated as authors on the study. All authors had the opportunity to access and verify the data, and all authors were responsible for the decision to submit the manuscript for publication.

Competing interests: I have read the journal’s policy and the authors of this manuscript have the following competing interests: Members of the funder (MB, NK, SE) participated as authors on the study, reviewed the manuscript, and approved the final manuscript as submitted. All other authors have declared that no competing interests exist.

Abbreviations:
AMR,
antimicrobial resistance; AUROC,
area under the receiver operator curve; CLSI,
Clinical and Laboratory Standard Institute; CRO,
carbapenem-resistant organism; CSF,
cerebrospinal fluid; EMLc,
Essential Medicine List for Children; EQA,
external quality assurance; ESBL,
extended-spectrum beta-lactamase; EUCAST,
European Committee on Antimicrobial Susceptibility Testing; FP,
fractional polynomial; HAI,
healthcare-associated infection; LMIC,
low- and middle-income country; MDR,
multidrug resistant; pSBI,
possible serious bacterial infection; SMART,
Sequential Multiple Assessment Randomized Trial; SOC,
standard of care; WHO,
World Health Organization

Introduction

Sepsis is responsible for a significant burden of disease in neonates and young infants, both as a primary cause of death and as a frequent contributor [1,2]. Access to facility-based delivery and care, including antibiotics, has not reduced mortality sufficiently to achieve the Sustainable Development Goal targets in many low- and middle-income countries (LMICs) [3]. Antimicrobial resistance (AMR) increasingly threatens to undermine the effectiveness of antibiotics and potentially slow progress in reducing mortality, particularly in LMICs [410], with AMR-attributable neonatal deaths recently estimated between 140,000 [10] and 214,000 annually [11].

Recent large-scale antibiotic trials in neonates and young infants in LMIC settings have largely focused on simplification of first-line antibiotic regimens with oral amoxicillin and short course gentamicin [12,13]. These have been based in primary healthcare settings and included populations with case fatality below 2%. An increasing global proportion of newborns are delivered in facilities [14], where sepsis case fatality rates and the burden of AMR are greater. Despite this, there is limited high-quality evidence generated from LMIC neonatal inpatient settings to guide empiric antibiotic treatment [4,15]. Published observational data largely involve single-center studies reporting non-systematically collected microbiological data, which are rarely accompanied by detailed clinical and antibiotic use data. Data regarding global antibiotic use often rely on point prevalence surveys, with limited information on patterns of switching and duration [16].

NeoOBS is a prospective multicountry observational study in which we collected detailed daily longitudinal data on clinical features, microbiology, antibiotic use, switching, and outcomes of neonatal sepsis in hospital settings, predominantly in LMICs. The objectives were to describe variation in clinical presentation and patterns of hospital-based antibiotic use and to develop 2 linked clinically based scores adapted from the World Health Organization (WHO) possible serious bacterial infection (pSBI) criteria, relevant to hospitalized neonates with sepsis in LMICs: (1) a sepsis severity score to predict 28-day mortality from factors known at sepsis presentation; and (2) a recovery score to predict the daily risk of death on treatment with IV antibiotics using daily updated assessments of clinical status. The aim of the study was to inform inclusion criteria, empiric and second-line treatment, and criteria for switching antibiotics in the design of hospital-based neonatal antibiotic trials [17].

Methods

Study design and participants

Hospitalized infants <60 days of age with a new episode of clinically suspected sepsis were enrolled between 2018 and 2020 in 19 hospital sites across 11 countries in Asia (Bangladesh, China, India, Thailand, and Vietnam), Africa (Kenya, South Africa, and Uganda), Europe (Italy and Greece), and South America (Brazil). Sites were selected after conducting a feasibility study [18] to represent diverse regions and to include secondary and tertiary referral hospitals, public facilities, and facilities with varying proportions of in-born and out-born infants, with access to microbiology.

Infants were eligible if the local physician had decided to treat the infant with antibiotics for a new episode of sepsis meeting the inclusion criteria (S1 Fig), derived by combining clinical and laboratory criteria from WHO pSBI [19] and EMA Criteria for neonatal sepsis trials [20]. To allow for variation in access to laboratory testing, and ensure generalizability to varying LMIC hospital contexts, laboratory values were not mandatory. A minimum of 2 clinical, or 1 clinical and 1 laboratory sepsis criteria, were required for inclusion, and up to 200 infants per site were enrolled according to a sampling frame adapted to local case volume and activity (see S1 Appendix). Infants were excluded if an alternative primary diagnosis other than sepsis was suspected, or a serious non-infective comorbidity was expected to cause death within 72 h. Previous antibiotic use was not an exclusion criterion as long as a new antibiotic regimen was being started after a blood culture for a distinct new episode of sepsis. Sepsis episodes occurring >48 h after admission, defined by time of blood culture, were defined as healthcare-associated infections (HAIs).

Ethical approval was obtained from St. George’s, University of London (SGUL) Research Ethics Committee and sites’ local, central or national ethics committees and other relevant local bodies, where required. This study is reported using (1) the Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) guideline (S1 STROBE Checklist), with its design guided by the STROBE-NI framework [21]; and (2) the Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD) guidelines [22]. The study was registered with ClinicalTrials.gov (NCT03721302).

Microbiology/laboratory assessments

Laboratory analysis was performed in each site following local practice, with standard operating procedures developed to optimize procedures including blood culture technique, and antibiotic susceptibility testing. A locally defined algorithm was used to classify contaminants and pathogens by site clinicians and microbiologists. External validation of the capability of laboratories to detect multidrug resistant (MDR) gram-negative pathogens from each site was evaluated objectively by testing an external quality assurance (EQA) panel sent from the central laboratory at Laboratory of Medical Microbiology (LMM) at the University of Antwerp (UA).

European Committee on Antimicrobial Susceptibility Testing (EUCAST) or Clinical and Laboratory Standard Institute (CLSI) guidelines and interpretive algorithms were used to interpret reported antibiotic susceptibility testing data for all organisms. EUCAST 2019 breakpoints table (v9.0) and guidelines were used as this was when most data was reported. In most cases, where susceptibility to a particular antibiotic was reported, this was used to determine susceptibility. If the organism was considered intrinsically resistant to a particular antibiotic according to EUCAST, then it was coded as resistant regardless of reported susceptibility. If susceptibility to a particular antibiotic was not reported, susceptibility results of a different antibiotic in the same class were used to determine susceptibility (e.g., susceptibility of organism to another aminoglycoside if gentamicin susceptibility was not reported); if no other antibiotic in that class had reported susceptibility, then susceptibility was coded as unknown.

Analysis of antibiotic patterns of use

The initial antibiotic regimen was defined as the first new antibiotic(s) started within 24 h from baseline blood culture (including 3 h pre baseline culture). To structure the analysis and reporting of the multiple regimens used, a novel method of grouping antibiotics was derived, based on the Essential Medicine List for Children (EMLc) AWaRe classification (Access, Watch, Reserve) [24], with the “Watch” category divided into 3 distinct groups of “Low/Medium/High Watch” based on inclusion in current WHO guidelines (Low Watch) and likelihood of resistance generation with regimens outside WHO recommendations (Medium or High Watch) [25]. Antibiotic groups were defined by the main “stem” in the antibiotic combination: Group 1 antibiotics included a first-line WHO-recommended penicillin-based regimen (e.g., ampicillin and gentamicin) (Access), Group 2 included third-generation cephalosporin (e.g., cefotaxime/ceftriaxone)-based WHO regimens (“Low” Watch), Group 3 included regimens with partial anti-extended-spectrum beta-lactamase (ESBL) or pseudomonal activity (e.g., piperacillin-tazobactam/ceftazidime/fluoroquinolone-based) (“Medium” Watch), and Group 4 included carbapenems (e.g., meropenem) (“High” Watch). Group 5 antibiotics included Reserve antibiotics targeting carbapenem-resistant organisms (CROs) (e.g., colistin). Aminoglycosides (e.g., gentamicin/amikacin), glycopeptides (e.g., vancomycin/teicoplanin), and metronidazole used in combination regimens were classified as additional coverage and did not define the main antibiotic “stem” for the grouping. All antifungals and antivirals were excluded from the antimicrobial treatment data as these were not relevant to the analysis. Escalation of treatment was defined as a switch to a higher group antibiotic, and de-escalation was defined as switching to a lower group or discontinuation of the “stem” antibiotic while continuing with an additional coverage antibiotic.

Statistical analysis

Baseline characteristics are presented as raw numbers, without any weighting (S1 Appendix). The prespecified primary outcome was mortality through 28 days post-enrolment, analyzed using Kaplan–Meier and Cox proportional hazards regression with site-level random effects. Time was measured from the initial blood culture sample, censoring at the earliest of day 28, withdrawal or last contact if lost post-discharge. Aligned with the WHO pSBI criteria, we developed 2 prediction models and corresponding risk scores: (1) a baseline NeoSep Severity Score to predict 28-day mortality from factors known at sepsis presentation; and (2) a NeoSep Recovery Score to predict the daily risk of death while treated with IV antibiotics from daily updated assessments of clinical status.

NeoSep Severity Score.

We constructed a Cox regression model for 28-day mortality based on baseline clinical parameters known at sepsis presentation (i.e., before availability of microbiological results) and used this to develop a neonatal sepsis severity score. Baseline was defined as up to 24 h from the time blood cultures were taken. Candidate predictors selected for initial consideration had missing values in <10% of the included infants and had been found to be predictive in other studies [26] or were a priori determined to be clinically important and not highly correlated with other factors (S1 Table). Factors that are not usually available in low-income settings including laboratory results (missing in >10%), and clinical signs with prevalence <5% were not considered. All analyses were based on available data, and imputation methods were not used. A 15% randomly selected sample per site was reserved for model validation and not used in any model development. This validation procedure was predefined because of expected large differences between sites in mortality, instead of randomly choosing 1 or 2 sites for validation which then might have turned out to be unrepresentative. In the remaining 85% of infants, model development used backwards elimination (exit p = 0.05) to identify independent predictors of mortality, considering both categorical (e.g., presence of sepsis signs) and continuous predictors (birth weight, gestational age, time in hospital, age at baseline, oxygen saturation, respiratory rate, heart rate, and temperature), taking into account the potential for nonlinear relationships (e.g., where both high and low values are associated with poor outcomes) by using fractional polynomials (FPs). FPs were implemented using the mfp command in Stata with power (−2, −1, −0.5, 0, 0.5, 1, 2) and significance level of 0.05 for testing between FP models of different degrees. Initial variable selection was done on complete cases that comprised 2,313/2,726 (85%) infants in the training set; final models were re-fitted to complete cases for the included factors (2,705/2,726 (99%) infants in the derivation sample). Interactions between birth weight and clinical predictors were examined and 1 strong interaction (p < 0.001) was found with ventilation support; however, this was not included in the final model in favor of simplicity for use of the prediction model in clinical practice and because including additional points for the interaction made very little difference to the C-statistic. The proportional hazards assumption was checked based on Schoenfeld residuals, and no deviations were found (p > 0.05). A points-based risk score, where each predictor of death is assigned a number of points, was then developed from model coefficients: For each of the categorical factors, coefficients were divided by the smallest of the coefficients (lethargy) and rounded to the nearest integer. For the continuous factors, a clinically relevant reference value was chosen (birth weight: 3,000 g; temperature: 37°C; gestational age: 39 weeks; time in hospital: 14 days), and the number of points associated with lower/higher values was then based on the difference to the reference. This initial score was then further simplified to provide a more feasible and pragmatic scale for use in LMIC by assigning 1 score point for a predictor if its value in the initial score was <3, 2 score points if its initial value was 3 to 5 points, and 3 score points if its initial value was ≥6 points. For each participant, points were added with a higher score indicating a higher risk of death. Harrell’s C-index with bootstrapped confidence intervals was used as a measure of discrimination of the prognostic Cox models, and the Hosmer–Lemeshow goodness of fit test was applied after running a logistic model with death as the independent and the score as the dependent variable as a measure of calibration.

We compared the NeoSep Severity Score with a score based on WHO pSBI [19], allocating 1 point for each of the following 6 signs: fever (≥38°C), low body temperature (<35°C), movement only on stimulation, feeding poorly, fast breathing (≥60 breaths per minute on days 0 to 6), and severe chest in-drawing. Whereas the first 5 signs were directly reported/measured in NeoOBS, the latter was only reported as part of a composite respiratory sign (severe chest wall in-drawing, increased oxygen requirement, or need for ventilation); for the score calculation, 1 point was given if this was reported.

NeoSep Recovery Score.

To estimate the association between daily risk of death after initiating IV antibiotics and time-updated factors, and hence develop a recovery score, we used Cox proportional hazards regression with time-varying independent factors. We used the same samples as described above for model derivation and validation. Infants were censored after stop of all IV antibiotics (cause-specific model). We started model building with all clinical predictors included in the baseline severity score as time-updated factors; unmodifiable infant and birth characteristics were excluded because they cannot evolve and would have restricted interpretation of the recovery score (e.g., preterm babies would have a minimum score of 1 and could have never reached “full recovery” with a score of 0). Forward selection (entry p = 0.05) was then used to identify any additional independent time-updated clinical predictors. FPs were used for continuous factors as described above. For vital parameters, the last value was used per assessment period. To avoid selecting factors representing the mechanism of dying rather than being predictors of subsequent death, clinical parameters reported on the previous day were used as predictors of death on the current day in all time-updated models. A points-based risk score was then derived similarly to the baseline severity score as described above. Discrimination was assessed using time-updated area under the receiver operator curves (AUROCs), ignoring censoring as this was low (<2% overall). To examine the potential usefulness of the recovery score for informing the decision to potentially switch to second-line antibiotics in future empiric clinical trials, we focused on day 2 (48 to 72 h) post baseline, a key decisional time point during clinical management when culture results become available and response to treatment is commonly evaluated.

Results

A total of 3,204 infants (90.4% neonates aged <28 days, n = 2,895; 42.1% female, n = 1,348) were recruited from 20 August 2018 to 29 February 2020 (Table 1 and S2 Fig). Sites included varying populations of infants and levels of care (S3S6 Figs and S2 Table). The median postnatal age was 5 (IQR 1 to 15) days, and 3,088 (96.4%) infants had been born in a hospital/facility (1,550 in the enrolling facility), 1,412 (44.3%) by cesarean section (969 as an emergency), and 71 (2.2%) had previously been treated for an episode of culture-positive sepsis. The median (IQR) gestational age at birth was 37 (31 to 39) weeks, with birth weight 2,500 g (1,400 to 3,000 g). At enrollment, 69.1% (n = 2,215) infants had been hospitalized since birth, and 30.9% (n = 989) were admitted from the community. A total of 2,759 (86.1%) were recruited in a neonatal unit. Among 309 (9.6%) infants enrolled aged ≥28 days, the majority (n = 181; 58.6%) were either ex-premature (n = 146; 47.4%) and/or had been admitted during the neonatal period (n = 136, 44.2%).

The most common previously identified risk factors for sepsis other than prematurity were preterm premature rupture of membranes (14.5%, n = 466), prolonged rupture of membranes (>18 h) (10.2%, n = 328), pre-labor rupture of membranes at term (9.4%, n = 300), presence of an indwelling central vascular catheter (8.2%, n = 262), intrapartum fever >38°C (3.5%, n = 112), chorioamnionitis (2.8%, n = 75), known maternal Group B streptococcus (GBS) colonization (1.4%, n = 46) and previous surgery (abdominal or for congenital malformations: 3.0%, n = 95). Of note, 8.3% (n = 265) had a congenital anomaly (S3 Table), and 7% (n = 220) of infants were exposed to maternal HIV. A total of 1,318 (41.1%) sepsis episodes were classified as healthcare-associated (occurring >48 h after hospital admission).

Clinical and laboratory findings

A median of 4 (IQR 2 to 5) clinical signs were present at baseline, the most common being respiratory (65.8%, n = 2,107), difficulty feeding (45.7%, n = 1,464), lethargy or reduced movement (35.3%, n = 1,131), abdominal distension (24.3%, n = 777), and evidence of shock (21.3%, n = 683), with prevalence decreasing over time (Table 1 and S7 Fig). Signs suggestive of meningitis were reported in <10% of infants (irritability, convulsions, abnormal posturing, bulging fontanelle) (S8 Fig). The availability of laboratory values varied by site. At baseline, a base excess <-10 mmol/L was documented in 352 of 1,492 with results available (23.6%), lactate >2 mmol in 1,034/1,283 (80.6%), raised CRP >10 mg/L in 1,306/2,286 (57.1%), abnormal white blood cell count (<4 or >20 × 109 cells/L) in 875/2,800 (31.3%), and thrombocytopenia (<150 × 109 cells/L) in 619/2,776 (22.3%) (S4 Table).

Patterns of empiric antibiotic use

A total of 1,180 (36.8%) infants had a history of previous intravenous antibiotic exposure, and 742 (23.2%) had been receiving an intravenous antibiotic in the previous 24 h before starting new antibiotics for the new episode of sepsis (85.3% treatment (n = 633) and 14.2% prophylaxis (n = 105)). Median time from baseline culture being taken to new intravenous antibiotic treatment being started for the distinct sepsis episode was 1 h (IQR 0 to 3); 2,913 (90.9%) started within 8 h, and 228 (7.1%) within 8 to 24 h (S9 Fig); 63 (2.0%) infants did not start any new antibiotic in the first 24 h.

There were 206 different combinations of empiric antibiotics started at baseline in this hospital-based cohort. The most frequent regimens are reported in S5 Table, and these were grouped as described in the methods. Of 3,141 infants who started new antibiotics within 24 h, 25.9% (n = 814) were started on a WHO Neonatal Sepsis Guideline recommended first-line penicillin-based regimen (Group 1—Access), and 13.8% (n = 432) were started on a WHO second-line cefotaxime or ceftriaxone-based combination (Group 2—“Low” Watch) (Fig 1A). The largest group (34.0%, n = 1,068) were started on a regimen providing partial ESBL/pseudomonal coverage (piperacillin-tazobactam, ceftazidime, or fluoroquinolone-based) (Group 3—“Medium” Watch). Of Group 3 regimens, ceftazidime ± amikacin (n = 436; 13.9%) and piperacillin/tazobactam ± amikacin (n = 410; 13.1%) were most common. Approximately 18.0% (n = 566) of initial regimens were carbapenem-based (Group 4—“High” Watch), among whom meropenem ± vancomycin (n = 447; 14.2%) was the most common combination; 1.8% (n = 57) of initial regimens were classified as Group 5 antibiotics targeting CROs, predominantly colistin-based (Group 5—Reserve) (S10S14 Figs). An “Other” group (n = 204) consisted of more rarely used local regimens not on the WHO EMLc, or regimens which did not include a new antibiotic “stem” that was used to define Groups 1 to 5 (e.g., aminoglycoside or glycopeptide given alone or in combination with each other) (S15 Fig). Cefoperazone/sulbactam (n = 99) was the most common antibiotic in this category, but was used as initial regimen only in India (n = 86; 14.5%), China (n = 11; 1.9%), and Vietnam (n = 2; 1.0%).

thumbnail

Fig 1. Antibiotic treatment.

(A) Initial empiric baseline therapy overall and by site. (B) First change of initial regimen, by baseline regimen. (C) Cumulative incidence of stopping all IV antibiotics (death on IV antibiotics as competing risk). Group 1 = First-line WHO-recommended penicillin-based regimen (e.g., ampicillin and gentamicin) (Access). Group 2 = third-generation cephalosporin (e.g., cefotaxime/ceftriaxone)-based WHO regimens (“Low” Watch). Group 3 = regimens with partial anti-ESBL or pseudomonal activity (e.g., piperacillin-tazobactam/ceftazidime/fluoroquinolone-based) (“Medium” Watch). Group 4 = Carbapenems (“High” Watch). Group 5 = Reserve antibiotics targeting carbapenem-resistant organisms (e.g., colistin). ESBL, extended-spectrum beta-lactamase; IV, intravenous.


https://doi.org/10.1371/journal.pmed.1004179.g001

There was wide variation between sites in the frequency of empiric use of different antibiotic groups (Fig 1A). Some sites used predominantly Group 1 antibiotics as the initial regimen, and others often started immediately with Group 3 or 4 regimens, or a mixture of all groups. The most frequently prescribed empiric regimens used for HAIs were meropenem ± vancomycin (28.2%, 361/1,280), followed by piperacillin/tazobactam ± amikacin (17.5%, n = 224), ceftazidime ± amikacin (6.7%, n = 86), colistin (4.3%, n = 55), and cefoperazone/sulbactam (3.9%, n = 50). Ampicillin + gentamicin was the most common regimen for non-HAI (19.7%, 366/1,861), followed by ceftazidime ± amikacin (18.8%, n = 350) and piperacillin/tazobactam ± amikacin (10.0%, n = 186) (S6 Table).

Adjusting for site, predictors of starting empiric therapy with Groups 3 to 5 rather than Groups 1 to 2 antibiotics included lower birth weight (OR = 0.57 per additional kg, 95% CI 0.47 to 0.69), presence of a central vascular catheter (OR = 3.48, 95% CI 1.74 to 6.94), previous antibiotics at enrollment (OR = 5.71, 95% CI 3.73 to 8.77), previous culture positive sepsis (OR = 25.71, 95% CI 3.00 to 220.7), longer time in hospital (48 versus 0 hours: OR = 4.41, 95% CI 3.41 to 5.69), previous surgery (OR = 5.18, 95% CI 1.65 to 16.28), and higher sepsis severity (OR = 1.27 per additional score point, 95% CI 1.16 to 1.40) (see S7 Table).

Antibiotic switching

After initial therapy, 728/2,880 (25.3%) who started on Group 1 to 4 regimens were escalated to a higher group regimen, the majority switching within the first days of treatment (S16S18 Figs), and 258/814 (31.7%) infants escalated from Group 1 (ampicillin/gentamicin) regimens, the majority of whom switched to Group 2 (cefotaxime/ceftriaxone-based) regimens (61.6%, n = 159) (Fig 1B). A total of 101/432 (23.4%) infants escalated from Group 2 regimens, mostly directly to a Group 4 carbapenem-based regimen (62.4%, n = 63) rather than a Group 3 partial ESBL/pseudomonal activity regimen (32.7%, n = 33); 287/1,068 (26.9%) escalated from Group 3 regimens, and of 566 infants starting treatment with carbapenems (Group 4), 82 (14.5%) escalated therapy to a colistin-containing regimen. Common reported reasons for first escalation of antibiotics overall included clinical deterioration (65.9%, n = 480), microbiology results (15.4%, n = 112: n = 48 pathogen identification, n = 37 susceptibility, n = 27 gram stain), and worsening inflammatory biomarkers (9.8%, n = 71). De-escalation of antibiotics was rare (173/2,937; 5.9%).

Cumulative incidence of stopping intravenous treatment within 7 days from baseline culture was 38.9% (95% CI 37.2% to 40.6%) overall (Fig 1C); 45.8% (95% CI 43.8% to 47.6%) in pathogen-negative and 6.9% (95% CI 5.0% to 9.2%) in pathogen-positive cases. After stopping intravenous antibiotics, 350/2,803 (12.5%) switched to oral therapy, and 289 of 2,803 infants who had stopped (10.3%), restarted intravenous antibiotics during the original hospital stay, and a further 84 infants after discharge. A total of 115 (3.4%) were still on uninterrupted antibiotic treatment at day 28. Median total number of days on intravenous antibiotics during the 28 days follow-up was 8 (IQR 6 to 14) days (S19 Fig). Intramuscular use of antibiotics was only reported for 2 infants (<0.1%).

Microbiology

Initial blood culture results were available for 3,195 (99.7%) infants; 693 (21.7%) grew at least 1 organism. Organisms identified as significant pathogens were isolated in 564/693 (>1 pathogen in 29) blood cultures, contaminants (presumed non-pathogens) in 117, and indeterminate in 12 cultures (Table 1). Gram-negative and gram-positive pathogens were found in 62.9% (355/564) and 34.8% (196/564) of infants, respectively (n = 8 with both), and fungal pathogens in 21. Among infants with a significant pathogen, Klebsiella pneumoniae (23.4%, n = 132), Coagulase-negative staphylococci (CoNS) (14.9%, n = 84), Acinetobacter species (12.8%, n = 72), S. aureus (9.6%, n = 54), and Escherichia coli (8.3%, n = 47) were the most common (Table 1 and S20 Fig) but with differences between sites (S21 Fig). Streptococcus agalactiae was found in only 19 babies (3.4%). All the common pathogens were identified in both early and late onset sepsis (Fig 2), although E. coli was more common in the first 3 days of life while K. pneumoniae was more frequent in late onset sepsis (S8 Table).

Approximately 58% (75/130 tested) of K. pneumoniae isolates were resistant to gentamicin, 75.0% (96/128) to commonly used third-generation cephalosporins (cefotaxime/ceftriaxone), 46.5% (53/114) to piperacillin-tazobactam, 46.6% (48/103) to ciprofloxacin, and 32.6% (43/132) to meropenem (S9 Table). Acinetobacter species were resistant to meropenem in 71.4% (50/70) of cases. E. coli retained greater susceptibility to third-generation cephalosporins (64.4%, 29/45 susceptible) and was susceptible in 90.4% (38/42) of cases to piperacillin-tazobactam. Among gram-negatives, there were important differences in susceptibility among aminoglycosides, with amikacin providing significantly better activity than gentamicin (e.g., 61.5% versus 38.5% susceptibility among K. pneumoniae) (Fig 3 and S9 Table).

thumbnail

Fig 3. Antibiotic resistance among leading pathogens in baseline blood culture.

(A) K. pneumoniae (n = 132). (B) Acinetobacter spp. (n = 72). (C) E. coli (n = 47). pip-taz = piperacillin/tazobactam.


https://doi.org/10.1371/journal.pmed.1004179.g003

Among 54 S. aureus isolates, 33 (61.1%) were methicillin resistant. S. aureus and CoNS were susceptible to vancomycin in all of 42 and 73 isolates tested, respectively, and all 19 Streptococcus agalactiae isolates were susceptible to ampicillin. Other rarer and more site-specific pathogens with high rates of resistance to antibiotics included Serratia species, Burkholderia species, and Elizabethkingia meningoseptica (S9 Table).

A total of 1,226/3,204 (38.3%) infants had a cerebrospinal fluid (CSF) culture performed at baseline or during the subsequent 7 days, and 73/1,226 (6.0%) were culture-positive, 47 with a pathogen, and 26 contaminant/indeterminate. Gram-negative organisms also dominated in CSF cultures (S10 Table).

Mortality

Median follow-up time was 29 (IQR 28 to 29) days. Overall, 350 infants (11.3%; 95% CI 10.2% to 12.5%) died within 28 days of baseline blood culture. Mortality at 28 days was unknown in 62 infants (5 withdrew in hospital, 57 were lost post-discharge). There was significant variation in mortality between different sites (Fig 4A). Mortality among infants with a pathogen-positive baseline culture was 17.7% (99/564; 95% CI 14.7% to 21.1%) compared with 9.9% (250/2,631; 95% CI 8.8% to 11.2%) in infants without pathogens (p < 0.001). Mortality was higher in infants with gram-negative (21.3%; 95% CI 17.4% to 25.9%) or fungal pathogens (38.1%; 95% CI 21.2% to 61.9%) than with gram-positive pathogens (8.5%; 95% CI 5.3% to 13.5%; p < 0.001). Mortality was 33.3% (95% CI 23.7% to 45.5%) in infants with Acinetobacter spp., 21.5% (95% CI 15.4% to 29.6%) with K. pneumoniae, 21.1% (95% CI 8.5% to 46.8%) with Streptococcus agalactiae, 12.8% (95% CI 6.0% to 26.3%) with E. coli, 11.1% (95% CI 5.2% to 23.1%) with S. aureus, and 3.6% (95% CI 1.2% to 10.8%) with CoNS (Fig 4B and S11 Table). Overall, gram-negative infections accounted for 75/99 (75.8%) of culture-positive deaths, especially K. pneumoniae (n = 28; 28.3%) and Acinetobacter spp. (n = 24; 24.2%).

There was no significant association between time from blood culture to start of new antibiotics and mortality, neither overall (HR = 1.01 per additional 2 h [95% CI 0.96 to 1.07]; p = 0.60) nor in the subgroup with no previous antibiotic exposure (HR = 1.03 [95% CI 0.95 to 1.11]; p = 0.53) (S22 Fig).

Baseline predictors of 28-day mortality

A prediction model was derived from 2,726 participants (n = 308 died) and validated in 478 (n = 42 died) (S12 and S13 Tables). Ten clinical factors known at presentation independently predicted mortality in the final model, including infant characteristics (birth weight, gestational age, duration of time in hospital, and congenital anomalies), level of respiratory support, and clinical signs (abnormal temperature, abdominal distension, lethargy/no or reduced movement, difficulty feeding, and evidence of shock) (Table 2). Risk was increased with both low (<35°C) and high temperature (≥38°C) with evidence of even higher risk if ≥39°C. The C-statistic for this model was 0.79 (95% CI: 0.77 to 0.81) and 0.81 (0.73 to 0.85) in the derivation and validation sample, respectively. A NeoSep Severity Score developed from these baseline predictors had a maximum 16 points (Table 2), with C-statistics 0.77 (95% CI: 0.75 to 0.80) and 0.76 (0.69 to 0.82) in the derivation and validation samples, respectively, and a good fit in the validation sample (Hosmer–Lemeshow p = 0.53). Defining mortality risk thresholds at 5% and 25% in the derivation sample categorized scores as low (score 0 to 4), medium (5 to 8), and high (9 to 16) risk, with mortality in the derivation sample of 1.6% (17/1,071; 95% CI: 1.0% to 2.5%), 14.4% (203/1,409; 12.7% to 16.3%), and 35.8% (88/246; 30.0% to 41.9%), and in the validation sample of 1.6% (3/189; 95% CI: 0.5% to 4.6%), 11.0% (27/245; 7.7% to 15.6%), and 27.3% (12/44; 16.3% to 41.8%) (Fig 5A).

thumbnail

Fig 5. NeoSep Severity Score at baseline.

(A) Distribution of the Severity Score at baseline (bottom), and the proportion (95% CI) of infants who eventually died within 28 days per score point (top) in the derivation (dark gray) and in the validation dataset (light gray). (B) Mortality (95% CI) in risk groups based on the Severity Score and selected sepsis subgroups. (C) Mortality (95% CI) in risk groups based on the Severity Score and region.


https://doi.org/10.1371/journal.pmed.1004179.g005

The association between the NeoSep Severity Score and mortality was similar within multiple subgroups, for example, in early or late-onset, community or healthcare-associated, high-, middle- or low-income settings (Fig 5B and 5C), term versus preterm infants, blood culture-positive versus culture-negative cases (S23 Fig).

A score based on WHO pSBI criteria had a C-statistic of 0.63 (95% CI 0.56 to 0.70) in the validation sample, lower than the NeoSep Severity Score. Because the WHO pSBI criteria do not include any infant/birth characteristics, to ensure an appropriate comparison, we also calculated a modified NeoSep Severity Score excluding these factors that had a C-statistic of 0.72 (95% CI 0.65 to 0.77) (S24 Fig).

Predictors of daily risk of dying of sepsis while on intravenous antibiotics in hospital: NeoSep Recovery Score

Seven of the time-varying factors independently predicted mortality in the final model, including all the non-modifiable signs in the NeoSep Severity Score (respiratory support and the 5 clinical signs), plus cyanosis, which had not added independent information in the baseline model (p = 0.228) (Table 2). A NeoSep Recovery Score developed from these time-updated predictors had a maximum 11 points and discriminated well between infants who died or survived the next day (Fig 6A and 6B). The AUROC of the recovery score on one day, for predicting death the next day, ranged between 0.8 and 0.9 over the first week post baseline in the derivation sample (Fig 7A). As expected, it slowly decreased over subsequent days. AUROC over time in the validation sample was similar although based on fewer numbers (Fig 7B).

thumbnail

Fig 6. Distribution of the NeoSep Recovery Score.

Distribution of the Recovery Score, based on clinical signs in the last 24 h, in the derivation (A) and validation (B) data. Numbers in parentheses: number of infants per group. s = survived, d = died the next day. Infants who died on same day are excluded for the relevant time points. The violin plots show the distribution of the Recovery Score as density plots plus median, interquartile range, and upper- and lower-adjacent values in infants with a score > 0.


https://doi.org/10.1371/journal.pmed.1004179.g006

thumbnail

Fig 7. NeoSep Recovery Score: Time-updated area under the ROC curve.

Time-updated area under the ROC curve (AUROCC) plus 95% CI in the derivation (A) and validation (B) data. Lines represent the trend in predictive value over time of the recovery score on a particular day for death in subsequent days. AUC (+ 95% CI) = area under the curve (+ 95% confidence interval), ROC = receiver operating characteristic.


https://doi.org/10.1371/journal.pmed.1004179.g007

The NeoSep Recovery Score on day 2 had an AUROC for dying in the following 5 days of 0.82 (95% CI 0.78 to 0.85) and 0.85 (95% CI 0.78 to 0.93) in the derivation and validation samples, respectively. A score ≥4 was the most discriminative, whether this was an increase from baseline, lack of initial response, or an improvement from a higher score down to 4, with sensitivity and specificity of 0.74 (95% CI 0.64 to 0.82) and 0.74 (0.72 to 0.75) in the derivation, and 0.87 (0.60 to 0.98) and 0.76 (0.71 to 0.79) in the validation samples.

Twenty-nine infants (derivation: n = 27; validation: n = 2) died between day 3 and 7 despite having had a score <4 on day 2. Of these, 6 deaths were not classified as infection-related, 10 had a subsequent increase in score to ≥4 before dying, 9 had at least 2 unmodifiable risk factors included as predictors of mortality in the NeoSep Severity Score (most commonly congenital abnormalities and very low birth weight), and 1 had congenital varicella; the remaining 4 had a day 2 score of 2 or more.

Of note, the change in score from baseline to day 2 had poorer discrimination than the absolute score on day 2, with an AUROC for dying in the following 5 days of 0.68 (95% CI 0.63 to 0.73) and 0.62 (95% CI 0.48 to 0.77) in the derivation and validation samples, respectively. Combining change in score using various cut-offs with <4/≥4 absolute score on day 2 also did not improve discrimination (S26 Fig).

Discussion

In this large hospital-based observational study in 11 countries across Africa, Asia, Europe, and Latin America, we observed wide variation in antibiotic prescribing beyond WHO recommendations for neonates and young infants with sepsis, with over 200 different empiric combinations, frequent empiric escalation, and rare de-escalation of therapy. Approximately 10% of pathogen-negative and 18% of pathogen-positive infants died. Gram-negative infections were associated with the majority of pathogen-positive deaths, and leading pathogens such as K. pneumoniae, Acinetobacter spp., and S. aureus were often resistant to WHO regimens. Mortality was independently associated with a range of factors, developed into a sepsis severity score which discriminated well between survivors and non-survivors at baseline. A recovery score was developed accounting for evolving clinical signs after baseline in order to assess response to antibiotics and also performed well at predicting death the following day. The NeoOBS study highlights the urgent need for clinical trials to inform antibiotic use for neonatal sepsis globally, describes the standard of care (SOC) against which novel antibiotic regimens should be compared, and provides severity and recovery scores that can be used for inclusion and antibiotic escalation criteria in future trial design.

Our data on antibiotic use are similar to recent point prevalence surveys in LMIC settings [16], but this study provides evidence on patterns of switching and escalation of therapy that is rarely available in the literature. A third of infants that had started on WHO-recommended regimens escalated to broader spectrum antibiotic regimens, and escalation overall was most commonly to a carbapenem. There was also considerable use of “carbapenem-sparing” regimens with “partial” ESBL and/or antipseudomonal activity that varied between centers (piperacillin-tazobactam, ceftazidime, and quinolone-based regimens, often in combination with amikacin). Evidence on carbapenem-sparing regimens is limited in neonates [27], but Pseudomonas infections were rare in NeoOBS and in the recent published literature [5,6]. Notably, a small but important proportion of infants in some sites received treatment for proven or suspected carbapenem-resistant infection with colistin, for which a recommended approach to dosing and combination therapy remains unclear, and CSF penetration and side effect profile are suboptimal [28]. Some countries are also using less widely used combinations such as cefoperazone/sulbactam, for which neonatal data are limited [29,30]. There was almost no de-escalation of therapy.

AMR was common and gram-negative pathogens (Klebsiella and Acinetobacter) were largely resistant to WHO-recommended regimens. Among gram positives, MRSA accounted for over half of S. aureus isolates. However, the extent to which increasing prevalence of AMR is associated with excess mortality, and whether this may be modifiable with different antibiotic treatment strategies is unclear in neonates and young infants. AMR has been demonstrated as an independent predictor of mortality in adults with bloodstream infections, including in LMIC settings [31]. However, data in neonates and young infants in many LMIC settings are scarce [15,32]. A limited number of largely retrospective observational studies have shown an increase in mortality particularly in association with resistant gram-negative infections [3336], such as those due to organisms producing ESBL [3740] and CROs [4144], and some suggest an increase in mortality in the absence of appropriate antimicrobial therapy [33,4548]. A recent large multicenter neonatal sepsis study (BARNARDS) demonstrated high resistance to ampicillin + gentamicin (97% and 70%, respectively) among gram-negative infections; however, analysis of the influence of antibiotic treatment on outcomes was confounded by country effects, and limited clinical data prevented adjusting for other important confounders [9]. Appropriate analyses of the causal relationship between discordant antibiotic treatment and mortality would need to consider baseline confounders as well as time-dependent confounding [49], but to our knowledge, this has not been done for neonatal sepsis in LMIC settings to an extent that could inform changes to global guidance [9,36,43,50,51]. Relevant analyses in the NeoOBS cohort are ongoing. Ultimately, evidence from randomized controlled trials is required to determine the most appropriate antibiotic regimens in different contexts.

The NeoSep Severity and Recovery Scores provided good discrimination. A score of 5 or higher at baseline was associated with 28-day mortality over 10%, and a recovery score 2 days after antibiotic initiation of 4 or higher had both sensitivity and specificity of 74% for mortality over the following 5 days. The NeoSep scores include clinical signs that have been designed to closely align with the WHO pSBI criteria (compared in S14 Table) used in recent community-based studies, including AFRINEST [52] and SATT [53,54]. However, the NeoSep scores include additional factors more relevant to hospital settings which offer further predictive value, such as clinical evidence of shock, and need for oxygen and/or respiratory support. Importantly, the study population also included a more heterogenous mix of both preterm (including <1,500 g) and term infants, with early and late onset, and community and healthcare-associated sepsis, and the Severity and Recovery Scores performed similarly across all these subgroups. Of note, convulsions were uncommon in the study (7%) and were not associated with mortality despite being a pSBI criteria for critical illness, potentially due to low power. Previous severity scores derived from hospital-based LMIC cohorts have been based on smaller studies assessing general illness severity rather than sepsis specifically [5560]. High-income setting-based general illness severity scores are more widely used [5557], and some have been assessed in small populations of septic neonates, but they are often unfeasible to apply globally [61,62]. The nSOFA score is a recent sepsis-specific score developed on 60 very low birth weight (<1,500 g) infants, but relies on measurement of parameters that are often unavailable in LMIC settings (e.g., use of inotropes/vasoactive drug use) [63]. The NeoSep scores are based on clinical variables that are relevant and feasible to collect globally across a range of LMIC hospital settings.

There are limitations to the generalizability of the data. The sites were selected to be heterogenous, varying in patient volume and case-mix, and with wide variation in levels of supportive care. Sites also varied in their ability to screen, using different screening schedules depending on staff levels and number of patients, particularly during nights or weekends. Given the very large numbers of infants presenting to many of the sites (see S1 Table), including out of hours, project funding was not available to prospectively screen and record the status of all admitted infants and reasons for not being enrolled. We would note, however, that the design enrolled participants who could feasibly be recruited to an antibiotic trial. Most sites in NeoOBS represented secondary or tertiary hospitals in urban settings, in some cases receiving referrals after prior treatment in other hospitals. In many cases, these facilities provided a higher level of care than is typical in many low-resource settings. AMR may be more important and a wider range of antibiotics may be available in some of the settings in this study compared to district hospital settings [64,65]. This is an important bias in most studies on AMR in low-resource settings, where the need for high-quality microbiology means certain settings are overrepresented.

A significant proportion (36.8%) of infants had previously received antibiotics before the new sepsis episode, which may influence culture yield and pathogen characteristics. Selection bias was also possible beyond site selection, with prospective recruitment potentially leading to a milder phenotype as infants who die rapidly with sepsis are more difficult to enroll, possibly leading to underestimation of mortality. There was wide variation in mortality between sites, highlighting major limitations of basing antibiotic recommendations on observational data alone. Analyses to determine the impact of antibiotic use on outcomes are confronted with important biases [49], including the influence of initial sepsis severity on antibiotic choice and timing of first administration, antibiotic availability and affordability, inter-site population heterogeneity, local guidelines and microbiology, and varying levels of supportive care.

In developing the NeoSep Severity and Recovery Scores, we attempted to account for inter-site variation by using multivariable models with site-level random-effects, excluding factors with a high proportion of missing values. We also examined performance of the Severity Score across different sites in high-, middle-, and low-income contexts and demonstrated reproducible risk trends across all settings. Of note, the scores were validated on a predefined reserved 15% internal sample, including only 42 deaths (and relatively few infants with high-risk scores). External validation would strengthen the applicability of the scores.

In the context of increasing resistance to WHO-recommended therapy for neonatal sepsis in LMIC settings, and a lack of evidence to guide optimal management due to the limitations of observational data, further randomized antibiotic trials are urgently needed. This study has demonstrated that there is no single accepted “standard of care” in neonatal sepsis in LMIC hospital settings, and any novel regimens will need to be compared to multiple widely used empiric regimens including current WHO-recommended guidance. NeoOBS data, including the severity and recovery scores, have informed the design of the NeoSep1 trial (ISRCTN 48721236) which will use a network meta-analytic approach to rank novel off-patent antibiotic combinations compared to WHO-recommended and other commonly used regimens, combined with a SMART (Sequential Multiple Assessment Randomized Trial) design to allow randomization to both empiric first- and second-line treatment [66].

Supporting information



Source link