Open Source Projects for Web Search Engine Components [closed] - open-source

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 7 years ago.
Improve this question
Can someone put a list of open source software of each of these components included in the question "Building a web search engine"?
That will save many people's time in the search for high quality open source components.
Component List:
Crawler
Parser
Indexer
Ranking Engine
Front End

For the Java world:
Parsers
JavaCC
Antlr
Indexer
Apache Lucene
Ranking Engine
Apache Lucene
Front End
whatever lightweight MVC framework

Lucene is also available for .NET http://incubator.apache.org/lucene.net/

It looks like nutch does everything you need out of the box.

Nutch + Lucene + Mapreduce/Apache is your solution.

Related

A Good JSON Database Library? [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 6 years ago.
Improve this question
I am using Node.js...
Personally, I'm not a fan of SQL. What I do like is JSON, and I would like to be able to store my server data as JSON.
Even though it's possible to just JSON.parse and .stringify the process, I get the impression that doing this is a bad idea for large-scale applications.
Also, it would be nice if the database library could be just a library, and not a server like with MySQL.
Any suggestions on where to find said library would be much appreciated.
Availability of said library on NPM would be brilliant.
I use low db befor for small nodejs projects. Its easy to use. See documentation.
Have fun.
https://www.npmjs.com/package/lowdb

web application using tcl/tk [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 8 years ago.
Improve this question
I want to develop a simple web based resource reservation system using TCL/TK.
Please let me know some good tutorials or links to start with.
Also some good examples please.
Thanks,
Ramya
OpenACS has a room reservation module included. OpenACS is written in TCL. There are tutorials available here.
Note that Tk is a library to program GUIs; it's not a part of Tcl which is a language (and its reference implementation).
But there is a project which provides for programming web presentation layer in a way resembling programming GUIs with Tk—it's called Æjaks.

Neo4j Documentation [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 5 years ago.
Improve this question
I've been looking into setting up and trying out Neo4j on Amazon EC2 however I seem to have hit a road block with finding documentation that I can use.
For example, this page mentions "Clustering, High Availability and Online Backup are add-on components that manage distribution and consistency over several live or offline node spaces." However, I haven't been able to find any more information on these add-ons.
http://wiki.neo4j.org/content/Neo4j_in_the_Cloud
This seemingly key wiki page describing how to set up Neo4j seems to be missing some very information.
http://wiki.neo4j.org/content/Neo4j_Setup_HowTo
Is anyone else using this and referencing better documentation, or am I completely missing something?
You can find some related info at this link, although the setup might depend a bit your requirements. The HA support is currently in beta, we will be able to let you test it as soon as Neo4j 1.0 is out of the door, which is before end of Jan 2010.

Scan Source Code for Licenses Used [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 7 years ago.
Improve this question
I remember encountering a commercial software that scanned a project's source code and provided a list of all the different software licenses it used. That is, it would find all the third party code used in a project and give their licenses (Apache, GPL, ...).
Can anyone provide a link to such a program?
Found the one I remembered:
http://www.blackducksoftware.com/protex - Protex by Black Duck.
The key term to use in google, as I learned the hard way, is "Software Compliance Management".
FOSSology is a GPL-licensed tool for analyzing OS licensing. It's main capability is to do pattern matching against uploaded source code and find matching licenses.
I know about a Ruby library called Ohcount.

Are there any open-source code-generation projects out there? [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 7 years ago.
Improve this question
I'm using an in-house code generator in which we supply all the metadata regarding our objects, and it generates our database, and object model including validation logic. Once we load all the data in the database, we use classic ASP <% %> to build our templates. I would like to know what open-source frameworks similar to this that are out there.
Thanks!
You can also check openArchitectureWare (http://www.openarchitectureware.org/) which is Eclipse-based.
Have you looked at mygeneration? It's not exactly open source but it's free, and the scripts are totally open. The most common usage is to generate the data layer given a database schema, but you can be more creative than that.
There's also Cog, which allows you to run Python scripts inside of source files. The included Cog module allows printing into the file that it's currently working on. Cog is licensed under MIT.
Check out Star UML it's open source and has some decent code generation capabilities.
I'd also look into Eclipse Plug-in options, I'm sure they're available I just haven't worked with any.
MyGeneration Code Generator is now Open Source.