Data-driven analysis and visualization of dielectric properties curated from scientific literature
Tomoki Murata, Naoto Saito, Eiji Koyama, T. Phuong, Ryusuke Misawa, Satoshi Yokomizo, Tomoya Mato, Yu Takada, S. Hirose, Yukari Katsura
Centered on 230,000+ experimental data points extracted from paper plots — complemented by crystal structure simulators, elemental reactivity maps, and other open tools for Materials Informatics research.
Starrydata is an open data project that extracts and structures experimental data from graphs in inorganic materials science papers, making them freely reusable for researchers worldwide.
Founded at NIMS in 2017, Starrydata is now operated by a dedicated research team in collaboration with RIKEN AIP, the University of Tsukuba and others. High-quality experimental data — useful for Materials Informatics, machine learning, and new-material discovery — are freely available via API, Figshare, and GitHub.
104,823 samples and 233,061 curves from 13,000+ papers. Numerical data carefully extracted from plots in literature, ready for machine learning and data analysis.
Beyond the database itself, we publish multiple free web tools for composition search, big-data visualization, plot digitization, and more.

Main database
Large open database with 200,000+ curves. Searchable and downloadable.

Big-data overview
Overview plots covering the entire Starrydata.

Composition-based search
Browse and filter samples by chemical composition.

3D thermoelectric plots
Interactive 3D plots of thermoelectric property distributions.

Extract data from plot images
Digitize plot images from published papers into numerical data.

Automated dataset summaries
View automatically generated summaries of the Starrydata dataset.

Crystal structure simulator
Interactive simulation and visualization of crystal structures.

Map of elemental reactivity
Visual map of reactivity between elements.

XRD plot generator
Generate and visualize Debye-Scherrer X-ray diffraction plots.
Use the data, contribute data, partner on research, or support the project — there are many ways to get involved.
Free search and download via the web system — instantly usable for ML and materials discovery.
Try Starrydata2 →Publish your group's experimental data through Starrydata. Boost the visibility and reuse of your research.
Get in touch →We welcome joint research and contract research. Let's advance Materials Informatics together.
Contact us →Seven research fields powering data curation and Materials Informatics studies.
Top 3 of 8 project papers from the Starrydata team. 184 external publications cite our work.
Tomoki Murata, Naoto Saito, Eiji Koyama, T. Phuong, Ryusuke Misawa, Satoshi Yokomizo, Tomoya Mato, Yu Takada, S. Hirose, Yukari Katsura
Yukari Katsura, Masaya Kumagai, Tomoya Mato, Yu Takada, Yuki Ando, Erina Fujita, Fumikazu Hosono, Eiji Koyama, Farhan Mudasar, T. Phuong, Naoto Saito, Yoshihiro Sakamoto, Atsumi Tanaka, Dewi Yana, Kaoru Kimura, Koji Tsuda, Masahiko Demura
Yukari Katsura, Tomoya Mato, Yu Takada, Eiji Koyama, Dewi Yana, Atsumi Tanaka, Masaya Kumagai
Starrydataのデータセットのスキーマを刷新したので共有します。 過去のスキーマ(Version 1.0、以下v1)のデータセットはGithub、新しいスキーマ(Version 2.0、以下v2…
Starrydataプロジェクトでは、論文中のグラフから実験データを抽出して公開しています。このとき、論文の中のすべてのグラフを集めようとすると膨大な時間がかかってしまうので、研究プロジェクトの目的に…
Materials Informatics(MI)はデータ科学を活用して新材料を開発するという 新しい研究分野です。そんな中、私達のStarrydataプロジェクトに関心を持っていただき、共同研究を申…
物質・材料研究機構(NIMS)主任研究員の桂ゆかりです。Starrydataと名付けた論文データ収集プロジェクトや、CRESTの大規模新物質探索プロジェクトなどいろいろな研究活動をしておりますが、情報…
Starrydataプロジェクトへの思いについて、noteに投稿しました。 https://note.com/yukarikatsura/n/n7191a553d3e4
We welcome data provision, joint research, and contract research. Let's advance Materials Informatics together.
Get in touch →Starrydata runs as a dedicated team of researchers and engineers in materials science, data science, and web development.
Funders and Partners