Host outside of IIS #22

vors · 2015-12-08T20:57:42Z

I tried host generated website with a simple python webserver

python -m http.server 8000

Search doesn't work in this case.

The text was updated successfully, but these errors were encountered:

KirillOsenkov · 2015-12-08T22:26:22Z

This is by design. The search part was written with ASP.NET Web API and would require a complete rewrite to support other technologies.

atifaziz · 2015-12-08T22:43:44Z

@KirillOsenkov What is that requires any server-side processing? Could one not just generate a static web site (that could then be hosted on any web server/site, including GitHub pages) & use JavaScript to implement client-side search? Is there anything architecturally preventing that?

KirillOsenkov · 2015-12-09T01:09:57Z

The only thing that's on the server is a list of declarations:
http://sourcebrowser.azurewebsites.net/#Microsoft.SourceBrowser.SourceIndexServer/Models/IndexLoader.cs,901622aa75abf138

and a few auxiliary data structures (list of all assemblies and all projects).

But primarily it's a list of all declared symbols:
http://sourcebrowser.azurewebsites.net/#Microsoft.SourceBrowser.SourceIndexServer/Models/IndexLoader.cs,e7decef3080779f5

Each declared symbol is basically a type with 5 things:
http://sourcebrowser.azurewebsites.net/#Microsoft.SourceBrowser.SourceIndexServer/Models/IndexEntry.cs,671f1c8b3a24aa51

Assembly number, glyph (icon), name (what you search for), symbol ID (the hex number used in hyperlinks to it) and the description (usually the full namespace and type and member name).

You can store this static list on the server as .txt, download it in JavaScript and implement search on the client without even going to the server.

The problem with that approach is SourceBrowser was designed to be highly scalable. It easily works with 60 million lines of code (all of Microsoft Developer Division source) and can scale to 100 million easily. This means around 6 million symbols currently (4 GB memory compressed). This is not something you can do on the client. Holding this list on the server is easy.

Implementing the feature we're talking about would basically mean removing SourceIndexServer and replacing it with a client side solution. This is a non-trivial amount of work and not something I'm willing to do (no time). I also don't believe it will scale past 100-200 thousand symbols, which would cover most midsize codebases but not the bigger ones.

If someone is willing to do the work, feel free to do so in your own forks, but I won't be taking this as a PR into my original repo (lots of support work I'm not prepared to do).

Thanks!

vors · 2015-12-09T20:51:29Z

Thank you for the explanation!
I think the description is a little bit misleading:

Create and host your own static HTML website

static HTML gives the impression that there is no need in running it in IIS.
Maybe it's worth to elaborate this part about server-side search in the project description.

KirillOsenkov · 2015-12-09T21:54:11Z

You're right. I'll update the description. Thanks!

atifaziz · 2015-12-10T07:21:55Z

Thanks for a summary of the design & the background on why the server part may be necessary. I had a hunch it might be about the scalability of the index/source/search when I browsed through the source on its initial publication. I'm guessing that the unmanaged memory allocations are also to work around GC pressure and limits, especially if you're going to load 4GB of compressed memory?

If someone is willing to do the work, feel free to do so in your own forks, but I won't be taking this as a PR into my original repo (lots of support work I'm not prepared to do).

Understood. However, it may help to depict the architecture & provide background on some of the design decisions (just as you did above) in a wiki so someone stands a good chance of doing that (or even assessing the feasibility of doing it) without first getting their head wrapped around the entire server code base.

KirillOsenkov · 2015-12-10T07:25:11Z

Yes, exactly. Using tricks like ushort (for tighter struct packing), allocating in native memory (to avoid 24 bytes per object overhead for 64-bit processes), compressing descriptions are all tricks that helped reduce the memory consumption on the server by a few gigabytes.

Good idea about the Wiki. I will add it.

KirillOsenkov · 2015-12-10T07:33:04Z

OK here you go: https://github.com/KirillOsenkov/SourceBrowser/wiki/Why-does-generated-Html-still-require-a-server-side-ASP.NET-Web-API%3F

atifaziz · 2015-12-11T15:17:50Z

@KirillOsenkov That's a start 👍 and thanks.

KirillOsenkov closed this as completed Dec 8, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Host outside of IIS #22

Host outside of IIS #22

vors commented Dec 8, 2015

KirillOsenkov commented Dec 8, 2015

atifaziz commented Dec 8, 2015

KirillOsenkov commented Dec 9, 2015

vors commented Dec 9, 2015

KirillOsenkov commented Dec 9, 2015

atifaziz commented Dec 10, 2015

KirillOsenkov commented Dec 10, 2015

KirillOsenkov commented Dec 10, 2015

atifaziz commented Dec 11, 2015

Host outside of IIS #22

Host outside of IIS #22

Comments

vors commented Dec 8, 2015

KirillOsenkov commented Dec 8, 2015

atifaziz commented Dec 8, 2015

KirillOsenkov commented Dec 9, 2015

vors commented Dec 9, 2015

KirillOsenkov commented Dec 9, 2015

atifaziz commented Dec 10, 2015

KirillOsenkov commented Dec 10, 2015

KirillOsenkov commented Dec 10, 2015

atifaziz commented Dec 11, 2015