Submission deadline: July 31, 2025 11:59 PM ET
Max Quasi-Clique Challenge
Challenge Description

Our goal in this challenge is to find large quasi-cliques in the undirected graph representing the FlyWire connectome. A subgraph with k vertices is a quasi-clique if it contains more than (k choose 2) / 2 edges - or in other words, if more than half of all possible edges exist between its vertices. The participant who finds the largest quasi-clique first will win.

Input and objective

You are given a simple graph (unweighted undirected) G = (V, E), where:

  • nodes V represent individual neurons
  • edges E represent synaptic connections between connected neurons

You need to find a subset of k vertices from V that contains strictly more than (k choose 2) / 2 edges. And your goal is to maximize k.

Relation to Motif Search

This problem is loosely related to motif search, a fundamental challenge in connectome network analysis. Existing motif search tools are optimized for finding small motifs, often struggling to scale efficiently to larger patterns. We hope that by optimizing algorithms for large dense subgraph detection, the techniques developed in this challenge may contribute to more scalable and practical motif search tools in the future.

Prerequisites

This is a data analysis / optimization challenge, and it is open to everyone. Participants do not have to know anything about the Drosophila brain or connectomics. Data processing and some algorithmic skills should suffice to compete.

Data
  • Simple graph underlying the FlyWire connectome (download here)
  • Benchmark solution (download here)
Here is a sample code in Python for verifying submissions:

# load data
edges = {(r[0], r[1]) for r in read_csv("edges.csv.gz")}
nodes = sorted([r[0] for r in read_csv("max_clique_submission_benchmark_162.csv.gz")])
# verify no duplicate nodes
assert len(nodes) == len(set(nodes)), "Duplicate nodes detected"
# verify that edge density is above 0.5
edge_count, pair_count = 0, 0
for i1, n1 in enumerate(nodes):
  for n2 in nodes[i + 1:]:
    pair_count += 1
    if (n1, n2) in edges:
      edge_count += 1
assert edge_count / pair_count > 0.5, "Insufficient density"

How to submit
  • Email your solution to arie@princeton.edu, including:
    • Size k of the subgraph found
    • CSV / text file with k rows / lines, each containing a neuron id corresponding to one vertex in the found quasi-clique (like in the benchmark solution)
    • Short explanation of how the result was achieved (method / main idea)
    • Kindly introduce yourself and share how you came to learn about the challenge
    • Mention if it is ok to have your name displayed on the leaderboard
  • Submissions will be verified and leaderboard updated periodically, until Jul 31, 2025 end of day (ET).
  • There is no limit on number of submissions per participant / team. Only the best one will be shown.
  • Winners of the challenge will receive a special plaque from the FlyWire team at Princeton University, and will be invited to give a short presentation (optional).
Notes
  • Original challenge (posted on March 7th) was about finding max cliques (complete subgraphs). As it turned out, it was not too difficult for the given connectome graph. The following participants all found maximal cliques of size 40 shortly after the challenge was posted: Wayne B Hayes, Justin Ellis-Joyce, Thiago Munhoz da Nóbrega, David A. Bader, Harinarayan Asoori Sriram, Srijith Chinthalapudi, Zhihui Du. In particular, Justin and Thiago enumerated all 252 maximal cliques of size 40. On March 9 the challenge was modified from finding cliques to finding quasi-cliques.
  • The edge list contains sorted pairs of nodes only. But since the underlying graph is undirected, if for example (x,y) is the edge list, then (y,x) is also an edge even though it is omitted from the list for compactness.
  • To not disadvantage early submissions, scores and solutions will be revealed after the challenge is complete.
  • Only the best score is recorded on the leaderboard for each participating team.
Leaderboard
Placement is determined by the size of the found quasi-clique, with ties broken by submission date.
Scores and solutions will be revealed after the challenge ends.
Date Name Subgraph Size Edge Density (# edges / # node pairs)
2025-03-10 Pedro José Correia Duarte, Christian Larsen, Rick Willemsen
current leader
- -
2025-03-13 David A. Bader, Harinarayan Asoori Sriram, Srijith Chinthalapudi, Zhihui Du - -
2025-03-15 Thiago Munhoz da Nóbrega - -
2025-03-17 Leo Versteegen - -
2025-03-20 Xin Zheng, Zuoyu Zhang, and Tatsuo Okubo - -
2025-03-10 Wayne B Hayes - -
2025-03-09 Justin Ellis-Joyce - -
2025-03-08 Benchmark 162 0.5013419216317767