Research Themes +
Energy Technologies Area (ETA) researchers are continually building on the strong scientific foundation we have developed over the past 50 years. We address the world’s most pressing climate challenges by bringing to market energy-efficient innovations across the buildings, transportation, and industrial sectors. ETA is at the forefront of developing better batteries for electric vehicles; improving the country's aging electrical grid and innovating distributed energy and storage solutions; developing grid-interactive, efficient buildings; and providing the most comprehensive market and data analysis worldwide for renewable technologies like wind and solar.
Strategic Initiatives +
The Energy Technologies Area (ETA) Strategic Plan is the guiding force for our research and development for the next ten years. It clearly charts a path toward clean-energy solutions and focuses on five detailed Strategic Initiatives. The Plan provides an in-depth look at how ETA is accelerating research to provide affordable, clean energy to all while accomplishing deep, economy-wide decarbonization, looking to avoid a rise in global average temperature while simultaneously developing solutions to increase humanity's resilience to extreme weather volatility.
Publications
News +
For media inquiries,
please contact ETA
Interim Communications Manager
Kiran Julin

kjulin@lbl.gov
About Us +
The Energy Technologies Area (ETA) is unique in translating fundamental scientific discoveries into scalable technology adoption. Our approach combines an understanding of the marketplace and the role of state and federal regulation and policies. ETA's research drives real-world, practical results that affect and improve the everyday lives of Americans and those across the globe. Saving energy and battling the Climate Crisis are key to the foundation of our research, which is driven by technoeconomic analysis and in-lab experimentation and discovery.

Unsupervised word embeddings capture latent knowledge from materials science literature

Publication Type

Journal Article

Date Published

07/2019

Authors

Tshitoyan, Vahe, John Dagdelen, Leigh Weston, Alexander Dunn, Ziqin Rong, Olga Kononova, Kristin A Persson, Gerbrand Ceder, Anubhav Jain

DOI

10.1038/s41586-019-1335-8

Abstract

The overwhelming majority of scientific knowledge is published as text, which is difficult to analyse by either traditional statistical analysis or modern machine learning methods. By contrast, the main source of machine-interpretable data for the materials research community has come from structured property databases1,2, which encompass only a small fraction of the knowledge present in the research literature. Beyond property values, publications contain valuable knowledge regarding the connections and relationships between data items as interpreted by the authors. To improve the identification and use of this knowledge, several studies have focused on the retrieval of information from scientific literature using supervised natural language processing3,4,5,6,7,8,9,10, which requires large hand-labelled datasets for training. Here we show that materials science knowledge present in the published literature can be efficiently encoded as information-dense word embeddings11,12,13 (vector representations of words) without human labelling or supervision. Without any explicit insertion of chemical knowledge, these embeddings capture complex materials science concepts such as the underlying structure of the periodic table and structure–property relationships in materials. Furthermore, we demonstrate that an unsupervised method can recommend materials for functional applications several years before their discovery. This suggests that latent knowledge regarding future discoveries is to a large extent embedded in past publications. Our findings highlight the possibility of extracting knowledge and relationships from the massive body of scientific literature in a collective manner, and point towards a generalized approach to the mining of scientific literature.

Journal

Nature

Volume

571

Year of Publication

2019

Issue

7763

ISSN

0028-0836

Organization

Applied Energy Materials Group, Energy Storage and Distributed Resources Division