use multiplex regex to implement routing #5

jackxxu · 2014-08-27T15:13:15Z

multiplexing newark routing

The objectives of this PR are to make routing in Newark framework flexible and fast.

the advantages

use multiplex routes (combining all the route regex to create a jumbo regex) and match one time instead of match individually. the following benchmark shows that in that particular example, routing times drops to almost 10% of the current approach (of course, your milage may vary). see below for benchmark results.
more readable routes: currently, in order to to match something like /people/1.xml and enforce the xml extension, we will need to use a nerdy regex like get \/people\/(?<id>\S*).xml$/ {...}. this change allows a much readable syntax such as get '/people/:id.xml/ {...}. It even allows syntax like get '/people/:id.:format/ {...} (see the 2 new tests test/test_router.rb)

N.B. the code can obviously be refactored, but I'd leave it for a future update.

Survey of routing algorithm in ruby web frameworks

many approaches are used in

our current approach that loop through all route's regex and match. some of the gems that use it include:
- journey gem (https://github.com/rails/journey), and thus Rails.
- sinatra gem (https://github.com/sinatra/sinatra)
the disadvantage of this approach is its O(n) complexity, especially a large amount of routes are matched.
multiplexing approach. this approach takes advantage of the ruby regular expression's union and named capture functionality to create a multiplex route and match only once. It additionally benefits from the better performance of === over match method. it has O(1) complexity.
- rack-multiplexer gem (https://github.com/r7kamura/rack-multiplexer)

Benchmark results

with mri ruby 2.1.1, on MacPro,

iterations = 100000

Benchmark.bmbm do |x|
  x.report('linear approach') do
    iterations.times do
      request_paths.each do |path|
        routes_regex.find {|regex| regex.match(path) }
      end
    end
  end

  x.report('multiplex approach') do
    iterations.times do
      request_paths.each do |path|
        jregex === path
      end
    end
  end
end

__END__

Rehearsal ------------------------------------------------------
linear approach     10.410000   0.010000  10.420000 ( 10.418204)
multiplex approach   1.480000   0.000000   1.480000 (  1.479806)
-------------------------------------------- total: 11.900000sec

                         user     system      total        real
linear approach     10.290000   0.010000  10.300000 ( 10.303956)
multiplex approach   1.500000   0.000000   1.500000 (  1.497642)
``

jackxxu added 2 commits August 27, 2014 09:30

use multiplex regex to implement routing

2e46f5a

add wildcard matching example

d62b244

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

use multiplex regex to implement routing #5

use multiplex regex to implement routing #5

Uh oh!

jackxxu commented Aug 27, 2014

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

use multiplex regex to implement routing #5

Are you sure you want to change the base?

use multiplex regex to implement routing #5

Uh oh!

Conversation

jackxxu commented Aug 27, 2014

multiplexing newark routing

the advantages

Survey of routing algorithm in ruby web frameworks

Benchmark results

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant