Warning: session_start(): open(/tmp/sess_fno4oo3sioerjslgl9bu1b7v20, O_RDWR) failed: No space left on device (28) in /www/H01/htdocs/lib/base/lib_base.php on line 280
Wikipedia Dump Reader openDesktop.org
-
 KDE-Apps.org Applications for the KDE-Desktop 
 GTK-Apps.org Applications using the GTK Toolkit 
 GnomeFiles.org Applications for GNOME 
 MeeGo-Central.org Applications for MeeGo 
 CLI-Apps.org Command Line Applications 
 Qt-Apps.org Free Qt Applications 
 Qt-Prop.org Proprietary Qt Applications 
 Maemo-Apps.org Applications for the Maemo Plattform 
 Java-Apps.org Free Java Applications 
 eyeOS-Apps.org Free eyeOS Applications 
 Wine-Apps.org Wine Applications 
 Server-Apps.org Server Applications 
 apps.ownCloud.com ownCloud Applications 
--
-
 KDE-Look.org Artwork for the KDE-Desktop 
 GNOME-Look.org Artwork for the GNOME-Desktop 
 Xfce-Look.org Artwork for the Xfce-Desktop 
 Box-Look.org Artwork for your Windowmanager 
 E17-Stuff.org Artwork for Enlightenment 
 Beryl-Themes.org Artwork for the Beryl Windowmanager 
 Compiz-Themes.org Artwork for the Compiz Windowmanager 
 EDE-Look.org Themes for your EDE Desktop 
--
-
 Debian-Art.org Stuff for Debian 
 Gentoo-Art.org Artwork for Gentoo Linux 
 SUSE-Art.org Artwork for openSUSE 
 Ubuntu-Art.org Artwork for Ubuntu 
 Kubuntu-Art.org Artwork for Kubuntu 
 LinuxMint-Art.org Artwork for Linux Mint 
 Arch-Stuff.org Art And Stuff for Arch Linux 
 Frugalware-Art.org Themes for Frugalware 
 Fedora-Art.org Artwork for Fedora Linux 
 Mandriva-Art.org Artwork for Mandriva Linux 
--
-
 KDE-Files.org Files for KDE Applications 
 OpenTemplate.org Documents for OpenOffice.org
 GIMPStuff.org Files for GIMP
 InkscapeStuff.org Files for Inkscape
 ScribusStuff.org Files for Scribus
 BlenderStuff.org Textures and Objects for Blender
 VLC-Addons.org Themes and Extensions for VLC
--
-
 KDE-Help.org Support for your KDE Desktop 
 GNOME-Help.org Support for your GNOME Desktop 
 Xfce-Help.org Support for your Xfce Desktop 
--
openDesktop.orgopenDesktop.org:   Applications   Artwork   Linux Distributions   Documents    Linux42.org    OpenSkillz.com   
 
Home
Apps
Artwork
News
Groups
Knowledge
Events
Forum
People
Jobs
Register
Login


-
- Content .- Fans (2) .- Knowledge Base  . 

Wikipedia Dump Reader

   0.2.10  

KDE Education

Score 86%
Wikipedia Dump Reader
zoom


Wikipedia Dump Reader
zoom


Wikipedia Dump Reader
zoom


Link:  http://
Downloads:  3537
Submitted:  Aug 30 2007
Updated:  Aug 16 2009

Description:

This simple programs display the text-only wikipedia compressed dumps, currently available at http://download.wikimedia.org/backup-index.html, generally named something like pages-articles.xml.bz2.

It's fairly useable now although lots of rendering issues occurs

Features includes a Qt viewer with basic text markup, following links, ability to read directly on the .bz2 compressed file (altough some index creations step is needed on first run), tab-like list of articles with load-in-the-background by default, a simple but useful keyword search, very light source-code, optional latex rendering.

The code requires PyQt4

Older versions has been tested on Fedora Core 4 and Kubuntu with PyQt4.1 (Python 2.4, Qt 4.2), and Ubuntu Gutsy.

See included README

Note that the development tree is now hosted on launchpad. See https://launchpad.net/wikipediadumpreader/

Any comment is welcome.




Changelog:

Updated to 0.2.10:
- Use a new indexing scheme for the entrylist - articles load faster now
- Upgrade path for old indexing scheme
- Utf8 fixes for non-ascii pathnames
- experimental RPM package - feedback welcome at the project website : https://launchpad.net/wikipediadumpreader

(jul 09: updated the ubuntu package for Jaunty's Python2.6 compatibility)

Updated to 0.2.9:
- make it able to load Wiktionary non-uppercased words
- Ability to load a 64-bits module - Thanks to Michael Heide
- added a small UI layout - Thanks to GreenReaper
- Better corrupted files handling

Updated to 0.2.8:
- Sorry : no program changes, but a much more friendly opening dialog
Built a rough Ubuntu package, to ease installation for unexperienced users running Ubuntu Gutsy or Hardy


Updated to 0.2.7:
- minor rendering fixes
- a few more macros

Updated to 0.2.6:
- better wikisyntax parsing
- minor bugfixes

Updated to 0.2.5:
- Bugfixes and improvement in rendering.
- Moved the development tree to lp
- optional fontsize

Updated to 0.2.4:
- Optional Latex/texvc call to render math. thanks to Mathieu Beliveau

Updated to 0.2.3:
- Fixed an obvious overflow bug in the index creation code.
Rebuilding the index is necessary, sorry. To force it, delete the two *idx files before running the program, and be patient (English dumps index creation takes several dozen minutes)
- basic table and footnotes support

Updated to 0.2.1 : fix a bug when reading articles on blocks boundaries
Updated to 0.2.2 : improved wiki rendering for lists and definitions




LicenseGPL
(Extractable Program (with source))
Ubuntu(Ubuntu debian package)
other(experimental, alien-converted RPM package)
Send to a friend
Subscribe
Other  Content  from benji2
Report inappropriate content



goto page:  1  2  3 

-

 Promising

 
 by jayenell on: Aug 30 2007
 
Score 50%

Looks very promising and also very clean. There are a few things I think should be fixed first.

1. Remove links to non existing articles (the red links)
2. Remove the interwiki links.
3. Remove links to categories or make sure the categories are parsed correctly.
4. Make sure when you click on a link you go directly to that article. Right now it is only added to the list on the right.
5. Make sure it depends on Python 2.5. 2.5 is default on Kubuntu and I had to install 2.4 also.

Keep up the good work,

J


Reply to this

-

 Re: Promising

 
 by benji2 on: Aug 30 2007
 
Score 50%

Hi !
Thanks for the comment
Those are interesting suggestion, i will have a look at it.

About 4., This is the intended behaviour, similar to "load tab in the background" in webbrowsers, which is a popular feature. You can get a direct-go behaviour by changing "self.loadTabInBackground = True" to False in the begining of dumpReader.py
Maybe i'll add an option for that in the GUI if needed.

About 5. The updated program should works on both Python 2.4 and 2.5. If not, please tell me the error you get. Some python module is included, maybe compiling it (as opposed to using the included precompiled .so) can help.


Reply to this

-

 lib

 
 by Superstoned on: Aug 31 2007
 
Score 50%

Pretty neath. BTW do you happen to know what happened to the integration library for wikipedia in KDE? There was supposed to be a library to make it easier to have wikipedia info in KDE apps, like currently in amarok and Marble and such... Maybe it's something for you ;-)


Reply to this

-

 Re: lib

 
 by Ekardnam on: Sep 11 2007
 
Score 50%

You (as in plural; I think superstoned already know about this ;) can read more about the Wikipedia and KDE cooperation here: http://meta.wikimedia.org/wiki/KDE_and_Wikipedia

It is indeed very nice, however, I don't know how it's proceeding either. :/


Reply to this

-

 great idea

 
 by REMF on: Sep 8 2007
 
Score 50%

i would love to see this released as an easy to use wikipedia offline .xml reader.

keep up the good work.


Reply to this

-

 keep going! (cheers)

 
 by REMF on: Sep 11 2007
 
Score 50%

i have been wanting an offline reader for ages, and finally it comes along.

thanks you.


Reply to this

-

 roadmap?

 
 by REMF on: Sep 21 2007
 
Score 50%

do you have a roadmap of where you want to go with this great app?


Reply to this

-

 Re: roadmap?

 
 by benji2 on: Sep 23 2007
 
Score 50%

Hi, thanks for your support.
My immediate goal is to fix one obvious overflow bug on the version 0.2.2 index creation code. Adding basic Table support should follow soon.
After that, i don't have much plans yet. I may work on improving wiki rendering or getting faster/smarter indexing. Also implementing the above mentionned suggestions about the category management sounds interesting.
Of course, i glady accept suggestions/feedback, both on the features or the UI


Reply to this

-

 Re: Re: roadmap?

 
 by REMF on: Sep 25 2007
 
Score 50%

thanks and good luck, i look forward to trying it soon.


Reply to this

-

 Great, but...

 
 by andrewmin on: Sep 24 2007
 
Score 50%

I love the idea, but I have one suggestion. What about using wget (or curl or whatever) to download the latest version from the repository? Then, you wouldn't have to redownload it manually.

Other than that, great job!


Reply to this

-

 Re: Great, but...

 
 by benji2 on: Oct 15 2007
 
Score 50%

Hi Thanks
Regarding your suggestion, it would be great - but it's indeed not possible to do, because there is no way to "update" the already downloaded dump. The only way to get more up-to-date wikipedia data is to delete the old dump (including indexes files) and fully download a new one.
Therefore, it's pointless to do that automatically. On the other hand, i'll add a few lines in the README explaining exactly that, so the user is not confused when he wants fresher data.


Reply to this

goto page:  1  2  3 

Add commentBack




-



 
 
 Who we are
Contact
More about us
Frequently Asked Questions
Register
Twitter
Blog
Explore
Apps
Artwork
Jobs
Knowledge
Events
People
Updates on identi.ca
Updates on Twitter
Content RSS   
Events RSS   

Participate
Groups
Forum
Add Content
Public API
About openDesktop.org
Legal Notice
Spreadshirt Shop
CafePress Shop
Advertising
Sponsor us
Report Abuse
 

Copyright 2007-2016 openDesktop.org Team  
All rights reserved. openDesktop.org is not liable for any content or goods on this site.
All contributors are responsible for the lawfulness of their uploads.
openDesktop is a trademark of the openDesktop.org Team