lmfdb-lite

A Python interface to the L-functions and modular forms database (lmdfb.org). It uses the psycodict library, which provides a dictionary-based approach to interacting with SQL databases and is in turn built upon psycopg2.

Install

Choose one of the following options (pgsource has additional requirements such as a C compiler and Python and libpq header files; see here for more details):

pip3 install -U "lmfdb-lite[pgbinary] @ git+https://github.com/roed314/lmfdb-lite.git"

or

pip3 install -U "lmfdb-lite[pgsource] @ git+https://github.com/roed314/lmfdb-lite.git"

Usage

>>> from lmf import db
>>> sorted(set(name.split("_")[0] for name in db.tablenames))
['artin', 'av', 'belyi', 'bmf', 'char', 'cluster', 'data', 'ec', 'fq', 'g2c', 'gps', 'halfmf', 'hecke', 'hgcwa', 'hgm', 'hmf', 'hmsurfaces', 'inv', 'lat', 'lf', 'lfunc', 'maass', 'mf', 'modcurve', 'modlgal', 'modlmf', 'nf', 'noncong', 'pg', 'quaternion', 'shimcurve', 'shimura', 'smf', 'test', 'weil']
>>> db.gps_conj_classes.search_cols
['aut_label', 'centralizer', 'counter', 'group_counter', 'group_order', 'label', 'order', 'powers', 'representative', 'size']

A web-based view of the LMFDB's available tables is available here; clicking on a table name gives the first few records in that table as well as a link to the schema for the table.

Searching

The search method is one of the two main public interfaces for performing SELECT queries, intended for finding multiple results. It takes as input a query dictionary (see the Queries section below for details) and a projection (a column name or list of column names) and returns an iterator as output. Each entry in the output is either the value of the column (if a single column was specified) or a dictionary with column names as keys.

>>> list(db.gps_groups.search({"order": 8}, "name"))
['C8', 'C2*C4', 'D4', 'Q8', 'C2^3']
>>> list(db.ec_curvedata.search({"rank": 1}, ["lmfdb_label", "conductor", "ainvs"], limit=5))
[{'lmfdb_label': '37.a1', 'conductor': 37, 'ainvs': [0, 0, 1, -1, 0]},
 {'lmfdb_label': '43.a1', 'conductor': 43, 'ainvs': [0, 1, 1, 0, 0]},
 {'lmfdb_label': '53.a1', 'conductor': 53, 'ainvs': [1, -1, 1, 0, 0]},
 {'lmfdb_label': '57.a1', 'conductor': 57, 'ainvs': [0, -1, 1, -2, 2]},
 {'lmfdb_label': '58.a1', 'conductor': 58, 'ainvs': [1, -1, 0, -1, 1]}]

If only a single result is required, you can also use the lucky method, which takes similar inputs.

>>> db.lf_fields.lucky({"galois_label":"4T4"}, ["p", "n", "coeffs"])
{'p': 2, 'n': 4, 'coeffs': [2, 0, 2, 2, 1]}

If you know the label for the object you can use the lookup method.

>>> db.nf_fields.lookup("6.0.9747.1", "regulator")
0.601543105945

Random objects

You can get a random object from a table (the projection defaults to the label in this case).

>>> db.gps_groups.random({"order": 256})
'256.45342'
>>> db.gps_groups.random({"order": 256}, ["label", "exponent", "nilpotency_class", "aut_order", "center_label"])
{'label': '256.48589', 'exponent': 4, 'nilpotency_class': 2, 'aut_order': 32768, 'center_label': '8.5'}

If you want many random objects, random_sample can be more efficient (different modes have different efficiency/randomness tradeoffs).

>>> from collections import Counter
>>> Counter(db.gps_groups.random_sample(0.01, {"order": 256}, "nilpotency_class"))
Counter({2: 325, 3: 236, 4: 18, 5: 4, 6: 2})
>>> Counter(db.gps_groups.search({"order": 256}, "nilpotency_class"))
Counter({1: 22, 2: 31742, 3: 21325, 4: 2642, 5: 320, 6: 38, 7: 3})

Statistics

Basic statistical quantities are available through built-in functions; of course you can compute more complicated statistics in Python.

>>> db.mf_newforms.count()
1141508
>>> db.mf_newforms.count({"weight": 1})
19306

>>> db.ec_curvedata.max("rank")
5
>>> db.ec_curvedata.max("rank", {"torsion_order": 8})
2

>>> db.nf_fields.min("regulator")
0.205216461048

>>> db.g2c_curves.distinct("end_alg")
['CM', 'M_2(Q)', 'Q', 'Q x Q', 'RM']

>>> db.ec_nfcurves.count_distinct("torsion_order")
29

>>> db.ec_curvedata.sum("rank") / db.ec_curvedata.count()
0.7829141621160285

Queries

Many methods (most notably search and lucky) take dictionaries that specify the WHERE clause in an SQL query, determining which rows of the table are returned. The query language used in these dictionaries does not allow for arbitrary criteria, but instead a more limited set of clauses tailored to support the LMFDB's search webpages. See the query language specification for more details.

Using SQL directly

You can run your own SQL commands directly using the underlying psycopg2 engine. This can be useful if you want to run queries that are not possible with the query language (e.g. joins, or comparing different columns in the same table).

>>> from psycopg2.sql import SQL, Identifier
>>> cur = db._execute(SQL("SELECT label, name FROM gps_groups WHERE {0} = %s").format(Identifier("order")), [8])
>>> list(cur)
[('8.2', 'C2*C4'), ('8.5', 'C2^3'), ('8.1', 'C8'), ('8.3', 'D4'), ('8.4', 'Q8')]

Troubleshooting

If the connection to the SQL database is broken, you may need to manually reset it using db.reset_connection().

If a query is takes an unexpectedly long time, you can see what the underlying SQL command being executed is.

>>> db.nf_fields.analyze({"ramps":{"$contains":[3,5,11]}}, "label")
b'SELECT "class_group", "class_number", "cm", "coeffs", "conductor", "degree", "disc_abs", "disc_rad", "disc_sign", "embeddings_gen_imag", "embeddings_gen_real", "gal_is_abelian", "gal_is_cyclic", "gal_is_solvable", "galois_disc_exponents", "galois_label", "galt", "grd", "index", "inessentialp", "is_galois", "is_minimal_sibling", "iso_number", "label", "local_algs", "minimal_sibling", "monogenic", "num_ram", "r2", "ramps", "rd", "regulator", "relative_class_number", "subfield_mults", "subfields", "torsion_order", "used_grh" FROM "nf_fields" WHERE "ramps" @> ARRAY[ARRAY[3,5,11]]::numeric[] ORDER BY "degree", "disc_abs", "disc_sign", "iso_number" LIMIT 1000'
Limit  (cost=0.56..31787.91 rows=1000 width=627) (actual time=328.322..8175.915 rows=1000 loops=1)
  ->  Index Scan using nf_fields_degree_disc_abs_disc_sign_iso_number on nf_fields  (cost=0.56..10823431.84 rows=340495 width=627) (actual time=328.320..8175.348 rows=1000 loops=1)
        Filter: (ramps @> '{{3,5,11}}'::numeric[])
        Rows Removed by Filter: 287074
Planning Time: 54.861 ms
Execution Time: 8176.447 ms

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
lmf		lmf
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

lmfdb-lite

Install

Usage

Searching

Random objects

Statistics

Queries

Using SQL directly

Troubleshooting

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

License

roed314/lmfdb-lite

Folders and files

Latest commit

History

Repository files navigation

lmfdb-lite

Install

Usage

Searching

Random objects

Statistics

Queries

Using SQL directly

Troubleshooting

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages