Repository: pdfbox-docs Updated Branches: refs/heads/master 8f22f405f -> 460b2396a
remove dead links, add semanticscholar.org Project: http://git-wip-us.apache.org/repos/asf/pdfbox-docs/repo Commit: http://git-wip-us.apache.org/repos/asf/pdfbox-docs/commit/460b2396 Tree: http://git-wip-us.apache.org/repos/asf/pdfbox-docs/tree/460b2396 Diff: http://git-wip-us.apache.org/repos/asf/pdfbox-docs/diff/460b2396 Branch: refs/heads/master Commit: 460b2396ab7b62ffe50dd79ec30a005992515cd8 Parents: 8f22f40 Author: John Hewson <j...@jahewson.com> Authored: Tue Jan 26 11:38:18 2016 -0800 Committer: John Hewson <j...@jahewson.com> Committed: Tue Jan 26 11:38:18 2016 -0800 ---------------------------------------------------------------------- content/references.md | 22 ++++++++-------------- 1 file changed, 8 insertions(+), 14 deletions(-) ---------------------------------------------------------------------- http://git-wip-us.apache.org/repos/asf/pdfbox-docs/blob/460b2396/content/references.md ---------------------------------------------------------------------- diff --git a/content/references.md b/content/references.md index b2ff0ce..287cf54 100644 --- a/content/references.md +++ b/content/references.md @@ -25,35 +25,29 @@ title: External Links This page lists projects that utilize PDFBox and articles that have been written about PDFBox. Please file an [improvement issue](https://issues.apache.org/jira/browse/PDFBOX) to get new projects or articles added to this page, or to update the information on existing links. -## Projects +## Projects using PDFBox | Project Name | License | Project Description | | --- | --- | --- | | [Alfresco](http://www.alfresco.org/) | LGPL - commercial services/support/training is available | Alfresco is an open source, open-standards content repository built by the most experienced content management team that includes the co-founder of Documentum.| -| [Apache Nutch](http://nutch.apache.org/) | Apache License V2.0 | Apache Nutch is open source web-search software. It builds on Apache Lucene, adding web-specifics, such as a crawler, a link-graph database, parsers for HTML and other document formats, etc.| -| [Apache Tika](http://tika.apache.org/) | Apache License V2.0 | Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.| -| [Centric CRM](http://www.centriccrm.com/) | Free To Use But Restricted/Commercial | The Most Advanced Open Source CRM Software.| +| [Apache Nutch](http://nutch.apache.org/) | Apache License v2 | Apache Nutch is open source web-search software. It builds on Apache Lucene, adding web-specifics, such as a crawler, a link-graph database, parsers for HTML and other document formats, etc.| +| [Apache Tika](http://tika.apache.org/) | Apache License v2 | Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.| | [Canoo Webtest](http://webtest.canoo.com/webtest/manual/WebTestHome.html) | BSD Like | Free OpenSource tool for XP-style acceptance testing of Java-based Web applications.| -| [contineo](http://webtest.canoo.com/webtest/manual/WebTestHome.html) | GPL | Contineo is a web based document management system.| | [ECM REWOO Scope](http://www.rewoo.de/) | Commercial | REWOO Scope is an Enterprise Content Management (ECM) software to organize, structure and consolidate enterprise data. Apache PDFBox is an integral part to read and index PDF documents.| -| [Jahia](http://www.jahia.org/) | collaborative source license | The Jahia product is currently the most powerful, ready-to-use and affordable integrated midrange Java Content Management and Corporate Portal Server.| -| [jLibrary](http://jlibrary.sourceforge.net/) | BSD | jLibrary is a Document Management System, oriented for personal and enterprise use.| | [Jomic](http://jomic.sourceforge.net/) | GPL | Jomic is a viewer for comic book archives.| -| [JpdfUnit](http://jpdfunit.sourceforge.net/) | Apache License V2.0 | pdfUnit is a framework for testing a generated pdf document with the JUnit Test Framework.| +| [JpdfUnit](http://jpdfunit.sourceforge.net/) | Apache License v2 | pdfUnit is a framework for testing a generated pdf document with the JUnit Test Framework.| | [Liferay Portal](http://www.liferay.com/) | MIT | Liferay Portal is an open source portal that helps organizations collaborate more efficiently by providing a consolidated view of disparate applications.| -| [LIUS](http://www.bibl.ulaval.ca/lius/index.en.html) | GPL | LIUS is an indexing Java framework based on the Jakarta Lucene project. The LIUS framework adds to Lucene many files format indexing fonctionalities as: Ms World, Ms Excel, Ms PowerPoint, RTF, PDF, XML, HTML, TXT, Open Office suite and JavaBeans.| | [LuceGene](http://gmod.org/wiki/LuceGene) | Artistic License | LuceGene is an open-source document/object search and retrieval system specially tuned for bioinformatics text databases and documents.| | [Lutece](http://www.lutece.paris.fr/) | BSD-like | Lutece is a portal engine which allows you to easily create your websites or intranets based upon HTML,XML content.| | [MMBase Lucene Module](http://mmapps.sourceforge.net/lucenemodule/) | MPL | Lucenemodule is a plugin (module) for the MMBase content management system that enables Lucene full text search through it's content, and thanks to PDFBox also PDF content.| -| [OpenCms](http://www.opencms.org/) | Custom | OpenCms is a professional level Open Source Website Content Management System.| +| [OpenCms](http://www.opencms.org/) | LGPL | OpenCms is a professional level Open Source Website Content Management System.| | [OpenSearchServer](http://www.open-search-server.com/) | GPLv3 | An open source search engine and crawler based on best open source technologies. It is a modern search engine and a suite of high-powered full text search algorithms.| | [Orbeon PresentationServer](http://forge.objectweb.org/projects/ops) | LGPL | Orbeon PresentationServer (OPS) is an open source J2EE-based platform for XML-centric web applications. OPS is built around XHTML, XForms, XSLT, XML pipelines, and Web Services, which makes it ideal for applications that capture, process and present XML data. Commercial consulting/training/support is available through orbeon.| -| [PDFcat](http://pdfcat.sourceforge.net/) | LGPL | PDFcat is multi-platform catalog manager that provides searching capability over documents among virtual catalogs.| | [SearchBlox](http://www.searchblox.com/) | Commercial | SearchBlox is a high-performance corporate search software designed for the Java 2 Enterprise Edition (J2EE) platform.| -| [SimplexRepaginator](http://www.simplexrepaginator.com/) | Apache License V2.0 | Simplex Repaginator converts simplex-scanned PDFs into properly duplex-paginated PDFs and vice versa. | +| [Semantic Scholar](https://www.semanticscholar.org)| Web Based | Semantic Scholar is a new service from AI2 for scientific literature search and discovery, focusing on semantics and textual understanding. +| [SimplexRepaginator](http://www.simplexrepaginator.com/) | Apache License v2 | Simplex Repaginator converts simplex-scanned PDFs into properly duplex-paginated PDFs and vice versa. | | [Terrier](http://ir.dcs.gla.ac.uk/terrier/) | MPL | Terrier is software for the rapid development of Web, intranet and desktop search engines.| -| [Triboni GinkGO](http://www.triboni.com/) | Commercial | Triboni GinkGO is a highly scalable J2EE services platform that is based on a simple XML business object defintion and scripting language. Toghether with XSLT content centric web applications can be configured in a very short time.| -| [Zilverline](http://www.zilverline.org/) | Collaborative Source License | Zilverline is a search engine that offers web access to your personal or intranet content.| +| [Triboni GinkGO](http://www.triboni.com/triboni/exec/x/int.triboni.website.display/xsl/display/name/Default/chapter/ginkgo/language/en) | Commercial | Triboni GinkGO is a highly scalable J2EE services platform that is based on a simple XML business object defintion and scripting language. Toghether with XSLT content centric web applications can be configured in a very short time.| ## Articles/Books