1 Anesthesiology and Pain Medicine , The Ottawa Hospital, Ottawa Hospital Research Institute , Ottawa , Canada 2 Clinical Epidemiology Program , Ottawa Hospital Research Institute , Ottawa , Canada 3 University of Ottawa Faculty of Medicine , Ottawa , Canada 4 Department of Health Sciences, Faculty of Science , Carleton University , Ottawa , Canada 5 Division of General Surgery , St. Michael's Hospital , Toronto , Canada 6 Division of General Surgery , St Michael’s Hospital , Toronto , Canada Correspondence to Dr Sylvain Boet, Anesthesiology and Pain Medicine, Ottawa Hospital, Ottawa Hospital Research Institute, Ottawa, Canada; sboet{at}toh.ca Abstract Background Educational interventions to improve teamwork in crisis situations have proliferated in recent years with substantial variation in teamwork measurement. This systematic review aimed to synthesise available tools and their measurement properties in order to identify the most robust tool for measuring the teamwork performance of teams in crisis situations. Methods Searches were conducted in Embase (via OVID), PsycINFO, Cumulative Index to Nursing and Allied Health Literature, Education Resources Information Center, Medline and Medline In-Process (via OVID) (through 12 January 2017). Studies evaluating the measurement properties of teamwork assessment tools for teams in clinical or simulated crisis situations were included. Two independent reviewers screened studies based on predetermined criteria and completed data extraction. Risk of bias was assessed using the COnsensus-based Standards for the selection of health Measurement INstruments (COSMIN) checklist. Results The search yielded 1822 references. Twenty studies were included, representing 13 assessment tools. Tools were primarily assessed in simulated resuscitation scenarios for emergency department teams. The Team Emergency Assessment Measure (TEAM) had the most validation studies (n=5), which demonstrated three sources of validity (content, construct and concurrent) and three sources of reliability (internal consistency, inter-rater reliability and test–retest reliability). Most studies of TEAM’s measurement properties were at no risk of bias. Conclusions A number of tools are available for assessing teamwork performance of teams in crisis situations. Although selection will ultimately depend on the user’s context, TEAM may be the most promising tool given its measurement evidence. Currently, there is a lack of tools to assess teamwork performance during intraoperative crisis situations. Additional research is needed in this regard. teamwork