Genome Query 0.1 (deadline: 17 March)

  • Search algorithms:
    • (tick) KMP
    • (tick) Suffix arrays (Victor)
    • (lightbulb) K-mers (Andrew)
  • Annotations are used to get more precised location:
    • (tick) search in genes
    • (tick) protein id
    • (lightbulb) exons & introns (Kate)
  • UI:
    • (tick) on the fly error checking (Alexey K.)
    • (lightbulb) better layout for info/show/count/find results (Vlad O.)
    • (tick) improve help layout: see current right side help (Natalya K.)
    • (tick) show sample queries on startup page (Alexey K.)
  • Other:
    • (tick) ability to stop long running tasks (Vlad R.)
    • (lightbulb) preprocessing, sequence condensed structure (Vlad R.)

Backlog

Search algorithms:

  • Compressed Suffix arrays (Andrew)
  • Inexact search
  • (lightbulb) BLAST (Vlad O.)

Annotations are used to get more precised location: (Kate) in progress

  • repeats
  • locuses

Parser

  • (tick) inexact search syntax (Alexey A.)
  • (tick) regular expressions (Alexey A.)
  • (question) parsing on client side
  • (question) sytax for search by shift before/after gene

TODO:

  • Use issue tracker!
  • on the fly syntax check (question) client side with better error recovery (question)
  • task estimation time based on algorithm used
  • better instant input validation
  • think about result layout
  • HISTORY of queries















(red star) - Must have feature
(blue star) - Nice to have feature

(question) - Under Investigation
(lightbulb) - In Progress
(tick) - Feature Implemented