I have had similar problems with the size of DBpedia. A simple solution is
to find the downloads that you are interested and filter out the
triples using grep. If I am interested in the article category
"Butterflies" and I suspect there are useful triples in the
article_categories
download I can use the following bash script and get a smaller set of
triples that contain the references to the URI of interest.
Hope this helps - Pete
#!/bin/bash
grep "<http://dbpedia.org/resource/Category:Butterflies"
article_categories_en.nt > dbpedia_nt/categories/butterflies.nt
On Fri, Dec 23, 2011 at 4:21 PM, Ravindra Harige <[email protected]>wrote:
> Hello everyone,
>
> I would like to use the DBpedia datasets for my first semantic-web
> project. So please help me with the following beginner question:
>
> I'm working with Jena framework and I have a long list cities across the
> world ~150 in my Joseki store. Now I want to fetch all the information
> related to these cities like, monuments, parks, stadiums, universities, etc
> that are in each one of these cities from DBpedia and store them into
> Joseki.
>
> I do not want to download the huge dataset dumps available on DBpedia site
> as I need data only about cities. So I would like to know how to achieve
> fetch only the required data programatically.
>
> Thanks.
>
>
> ------------------------------------------------------------------------------
> Write once. Port to many.
> Get the SDK and tools to simplify cross-platform app development. Create
> new or port existing apps to sell to consumers worldwide. Explore the
> Intel AppUpSM program developer opportunity. appdeveloper.intel.com/join
> http://p.sf.net/sfu/intel-appdev
> _______________________________________________
> Dbpedia-discussion mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion
>
>
--
------------------------------------------------------------------------------------
Pete DeVries
Department of Entomology
University of Wisconsin - Madison
445 Russell Laboratories
1630 Linden Drive
Madison, WI 53706
Email: [email protected]
TaxonConcept <http://www.taxonconcept.org/> &
GeoSpecies<http://about.geospecies.org/> Knowledge
Bases
A Semantic Web, Linked Open Data <http://linkeddata.org/> Project
--------------------------------------------------------------------------------------
------------------------------------------------------------------------------
Ridiculously easy VDI. With Citrix VDI-in-a-Box, you don't need a complex
infrastructure or vast IT resources to deliver seamless, secure access to
virtual desktops. With this all-in-one solution, easily deploy virtual
desktops for less than the cost of PCs and save 60% on VDI infrastructure
costs. Try it free! http://p.sf.net/sfu/Citrix-VDIinabox
_______________________________________________
Dbpedia-discussion mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion