Find Closest Matching Document (NLP, Algorithm Etc..)

Hi,

We'd like to find someone who can either give us a proven Psaudo-code, or better off - send us a working code in C# that does the following:
1. The code gets a document (preferably HTML, but not a must).
2. The code finds the best matching document, from a list of pre-defined documents (possibly that the calculation had already ran through).
3. It returns a list of N documents ordered by best match first.

For example -
If I read this article on Engadget - http://www.engadget.com/2011/12/27/toshiba-thrive-7-review/

Which is about the Toshiba Thrive android tablet, then the "best match" documents would be the ones that are also talking about the same topic, and after them in order documents that talk about generic/other android tablets, then after that possibly documents about either tablets in general (iPad?) or Android in general and so on and so forth.

Possible places to look at are opencalais, td-idf, etc...

There are many algorithms out there, so we need something that will do the job well.

The test of the algorithm should be by taking ANY document, and finding the best matched from a pre defined list of other documents.

Once this is done, and we have an algorithm we can try, we will need to open another bid, to implement the algorithm on an existing C# based system (Like SharePoint) with many medical documents on it (but the algorithm must not use the fact that it's medical data, at least, not at first).

Search All Jobs:

Pcb Board Designing - Open To Bidding

i need to get a embedded pcb board designed. with good EMI outstanding.i need to get a embedded pcb board designed. with good EMI outstanding.

Skills Required: C++ Programming , Delphi , Electronics
  • Fixed Price Project
  • $250-$750
  • Job Expired

Assignment Report For Algorithm And Advanced Data Structure

All Pairs Shortest Paths Problem (APSPP) Consider a weighted complete graph G with vertex set G.V = {v0, v1, v2, ..., vn-1}. The weight of the edge from vi and vj is denoted as G.w(i, j). It is assumed that the weights of the edges are non-negative...

Skills Required: Algorithm , Mathematics
  • Fixed Price Project
  • $12-$150
  • Job Expired

Do Some Excel Work

I have filled an excel (.xls) spreadsheet with 80+ tabs and a bunch of individuals names that include URL hyperlinks. All of the hyperlinks are in the spreadsheet already. Please run a macro, extract the data required (see instructions), and combine it all into one tab...

  • Fixed Price Project
  • $10-$30
  • 3 days, 7 hours left

Do Some Work In Excel

I have filled an excel (.xls) spreadsheet with URL hyperlinks. All of the hyperlinks are in the spreadsheet already. Please run a macro, extract the data required (see instructions), and combine it all into one tab. Here are the directions: 1) Open .xls spreadsheet 2) The source tab in the ...

  • Fixed Price Project
  • $10-$30
  • 3 days, 7 hours left

Do Some Excel Work

I have filled an excel (.xls) spreadsheet with 80+ tabs and a bunch of individuals names that include URL hyperlinks. All of the hyperlinks are in the spreadsheet already. Please run a macro, extract the data required (see instructions), and combine it all into one tab...

  • Fixed Price Project
  • $10-$30
  • 3 days, 10 hours left

Check If A Web Site Is Working

Hello I want to create a desktop application that will check if a web site is down or not. In case it is down, it should send an email to admin

Skills Required: C++ Programming , PHP , Python
  • Fixed Price Project
  • $30-$250
  • 3 days, 14 hours left

Count And Label Objects In A Photo

The attached photo of a ceiling has thousands of small acoustic circles attached to the ceiling. I need someone to COUNT the number of acoustic circles AND label each one of them with a unique (small but legible) number so it is easy to verify the total count of circles.

  • Fixed Price Project
  • $10-$30
  • 4 days, 11 hours left

Time Series Predictive Model Using C# Language

I need to add on a time series predictive model feature on my current application, based on auto regressive model, using the Least Square Minimization Algorithm. It must be done using visual studio 2010, C# programming language...

  • Fixed Price Project
  • $30-$250
  • 4 days, 17 hours left

Time Series Predictive Model Using C# Language

I need to add on a time series predictive model feature on my current application, based on auto regressive model, using the Least Square Minimization Algorithm. It must be done using visual studio 2010, C# programming language...

  • Fixed Price Project
  • $30-$250
  • 4 days, 17 hours left

Build And Run Code Using Microsoft Visual Studio -- 2

I've installed Microsoft visual studio Express, but I've been unable to run the demos of the svdfeature toolkit (see http://jmlr.org/papers/volume13/chen12a/chen12a.pdf) on my PC (windows 8, 64 bit)...

  • Fixed Price Project
  • $15-$25
  • 5 days, 11 hours left

Job Details

  • Job Type
  • Fixed Price Job
  • Budget
  • $250-$750
  • Start Date
  • 12/28/2011 08:47:45 AM
  • Job Status
  • Job Expired