What is metRef?

It's a database where measures of genomic complexity metrics are collected on genomes from the RefSeq database.

What genomes are found in the database?

Not all RefSeq genomes can be found in the database. The RefSeq genomes are filtered so that the ones present:
Have a RefSeq category of "representative" or "reference". Are not excluded from RefSeq. Have an assembly level of "Complete genome" or "Chromosome" Belong to a defined group and they are not viruses.

Versioning

The database has its own versioning system. From time to time a new version is established where the genomes in the database are updated depending on the genomes available in RefSeq. Previous versions are kept in the database, to be able to perform subsequent studies on the same version.

What variables are stored?

Two metrics are stored in the database:
Biobit: The BB metric is a logistic map that balances a genome's entropic and anti-entropic components. BB compares the true genome with a random equifrequent one with the same length.
https://www.nature.com/articles/srep28840
Genomic Signature: The GS metric focuses on the k-mers content of a given genome.
https://digital.csic.es/handle/10261/225017
Several genomic parameters are stored in the database. They include:
Percentage of hapaxes. Calculation of entropy. Genome size. ACGT content. GC content. Number of annotated genes. Number of protein coding genes. Number of non coding genes.

If you are interested, please login or register to use the site.