Skip to content
/ hail Public
forked from hail-is/hail

Cloud-native genomic dataframes and batch computing

License

Notifications You must be signed in to change notification settings

grohli/hail

This branch is 73 commits behind hail-is/hail:main.

Folders and files

NameName
Last commit message
Last commit date
Oct 15, 2024
Aug 7, 2024
Aug 7, 2024
Oct 11, 2024
Sep 16, 2023
Oct 11, 2024
Oct 18, 2024
Sep 15, 2023
Oct 17, 2024
Jun 18, 2024
Oct 11, 2024
May 13, 2024
Oct 11, 2024
Jan 3, 2024
Sep 3, 2020
Dec 7, 2023
Oct 18, 2024
Oct 21, 2024
May 23, 2024
Dec 7, 2023
Mar 1, 2024
Jan 23, 2024
Apr 11, 2024
Sep 16, 2023
Oct 11, 2024
Jul 22, 2024
Nov 25, 2020
Jun 18, 2024
Mar 20, 2024
Feb 6, 2024
Jan 11, 2024
Jun 21, 2020
Apr 21, 2016
Jun 28, 2022
Jan 3, 2023
Jul 9, 2024
Jun 27, 2022
Oct 11, 2024
Oct 11, 2023
Apr 26, 2024
Jan 3, 2024
Oct 27, 2023
Apr 26, 2024
Dec 1, 2023
Mar 26, 2024

Repository files navigation

Hail

Zulip DOI PyPI version

Hail is an open-source, general-purpose, Python-based data analysis tool with additional data types and methods for working with genomic data.

Hail is built to scale and has first-class support for multi-dimensional structured data, like the genomic data in a genome-wide association study (GWAS).

Hail is exposed as a Python library, using primitives for distributed queries and linear algebra implemented in Scala, Spark, and increasingly C++.

See the documentation for more info on using Hail.

Community

Hail has been widely adopted in academia and industry, including as the analysis platform for the genome aggregation database and UK Biobank rapid GWAS. Learn more about Hail-powered science.

Contribute

If you'd like to discuss or contribute to the development of methods or infrastructure, please:

Hail uses a continuous deployment approach to software development, which means we frequently add new features. We update users about changes to Hail via the Discussion Forum. We recommend creating an account on the Discussion Forum so that you can subscribe to these updates as well.

Maintainer

Hail is maintained by a team in the Neale lab at the Stanley Center for Psychiatric Research of the Broad Institute of MIT and Harvard and the Analytic and Translational Genetics Unit of Massachusetts General Hospital.

Contact the Hail team at hail@broadinstitute.org.

Citing Hail

If you use Hail for published work, please cite the software. You can get a citation for the version of Hail you installed by executing:

import hail as hl
print(hl.citation())

Which will look like:

Hail Team. Hail 0.2.13-81ab564db2b4. https://github.com/hail-is/hail/releases/tag/0.2.13.
Acknowledgements

The Hail team has several sources of funding at the Broad Institute:

  • The Stanley Center for Psychiatric Research, which together with Neale Lab has provided an incredibly supportive and stimulating home.
  • Principal Investigators Benjamin Neale and Daniel MacArthur, whose scientific leadership has been essential for solving the right problems.
  • Jeremy Wertheimer, whose strategic advice and generous philanthropy have been essential for growing the impact of Hail.

We are grateful for generous support from:

  • The National Institute of Diabetes and Digestive and Kidney Diseases
  • The National Institute of Mental Health
  • The National Human Genome Research Institute
  • The Chan Zuckerberg Initiative

We would like to thank Zulip for supporting open-source by providing free hosting, and YourKit, LLC for generously providing free licenses for YourKit Java Profiler for open-source development.

About

Cloud-native genomic dataframes and batch computing

Resources

License

Citation

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 49.3%
  • Scala 43.7%
  • Jupyter Notebook 1.7%
  • HTML 1.4%
  • C++ 1.2%
  • HCL 0.7%
  • Other 2.0%