Item database project (for all these spreadsheets)
Welcome to Elitist Jerks
We're testing some new features on the site regarding OpenID registration and coordination with gamerDNA. If you experience any issues with registering an account, please take the time to fill out a report and send it to this e-mail address. We would appreciate any assistance you could provide in making sure everything is functioning as intended. Thanks!
If this is your first visit, please be sure to check out the FAQ and the forum rules. Users must register to post and new registrations are subject to a one day "mute" period to get acquainted with the community.
I would love to have a copy of the parser source! Also, I'm interested in how you put this in excel. When i save the code snippet you provided as an xml file and opened with Excel, I simply got a cell with the values entered as text.
ok, i will clean it up and post it here in a few days.
Is there a correct way to get this data into excel so that each attribute is busted out into its own column?
well i posted only a snipped of the file. Try this snippet, save it as test.xml and just open it in a recent excel version (edit: i tried it with Excel 2002 and OpenOffice 1.1, that didn't work, but Excel 2003 (XP?) did it correctly). It should fill the columns like you wanted. If you dont like this strange xml markup (dropdowns, blue border etc), select everything and "copy"-"paste contents" into a new sheet.
If someone finds a way to import data from a file that looks similar (otherwise I will try a simple .csv) into an older Excel version and/or OpenOffice, please let me know. It was just the first option I tried and it just worked...
okay, I modified the tool, so that it also exports a simple .csv file (that Excel 2002 can open, and it is alot smaller). I recommend to use the .csv-file for Excel-import (simply double-click the file).
To download the dot-net project with source, executable and the data pulled from allakhazam yesterday, go to to filefront here: http://hosted.filefront.com/stirius/
and download ExtractItem_v1.zip (about 1mb).
The database contains about 12k items but only with a few stats. The data.csv looks like this:
Name;ID;Slot;WearType;Armor;Intellect;Spirit;Stamina;SpellHit;SpellDmg;SpellFrost;SpellCrit;SpellPen;Heal;mp5;Sockets
68 TEST Green Cloth Head of;26132;Head;Cloth;108;;;;;;;;;;;
Abjurer's Hood of;9940;Head;Cloth;53;;;;;;;;;;;
Admiral's Hat;10030;Head;Cloth;51;;;;;;;;;;;
Adventurer's Bandana of;10261;Head;Leather;130;;;;;;;;;;;
Adventurer's Pith Helmet;9420;Head;Leather;94;;7;14;;;;;;;;
Alabaster Plate Helmet;8317;Head;Plate;419;;15;16;;;;;;;;
Amethyst Beholder Eye of;31220;Head;Cloth;120;;;;;;;;;;;
Ancient Crown of;15602;Head;Mail;202;;;;;;;;;;;
Ancient Terokkar Hood;31784;Head;Leather;188;;;21;;;;;;;;
Ango'rosh Helm of;24930;Head;Plate;638;;;;;;;;;;;
Ango'rosh Souleater's Cowl;25558;Head;Cloth;103;25;16;25;;28;;;;;;
Antlers of Malorne;29093;Head;Leather;271;25;19;22;;32;;20;;;;2
...
Even though you already worked around the older versions thing by going to CSV, here is some follow-up:
Excel 2003 is where XML really starts to enter, as far as I know. It's pretty well-supported in 2003, and 2007 takes it a step beyond that with actually having the native Excel file format be an XML package.
In 2003 (and 2007, anything prior, at least at my job, is really non-existent) you can actually create an Excel XML map of the file. What this does is if you tell it to create an XML map, and point it to your XML data, it will auto-generate a schema for you and create a table of items that represent the elements of your XML. You can literally then drag these elements onto the worksheet, and Excel will automatically generate pivot tables of all of the data. This allows you to do formatting, etc. etc. and have a baseline template to act from. Then, once the map is done, it's literally a matter of telling it to import an actual XML data file, and boom, all the fields you dragged over are auto-populated based on the XML data.
It's trivial in .NET to reference Excel as an object and do all of this automatically, spitting as an end-result an Excel file, formatted, and with all the data present from whatever XML file you point it to. You could also create Excel macros that will automate pulling XML data in even more easily.
I just wanted to illustrate this mapping approach, as it is much, much, much more powerful than just a simple XML import.
To download the dot-net project with source, executable and the data pulled from allakhazam yesterday, go to to filefront here: http://hosted.filefront.com/stirius/
and download ExtractItem_v1.zip (about 1mb).
post here, if you have any questions.
The csv seems to be seperate by semi-colons, not commas, I had to use the data import and specify semi-colon as the seperator for it to work for me.
Very awesome work though.
I loaded this data into excel, turned on autofilters, and was able to quickly make some farming lists.
Last edited by Nomad_Wanderer : 05/03/07 at 2:43 PM.
see the contained README.txt for configuring the parser, weartypes and stats (no compilation needed, only edit .txt files!).
Contained is the raw data from wow.allakhazam.com downloaded yesterday containing about 12k items.
Post here, if you somehow improved the parsing (i don't really understand the differents stats regarding Block, Blockvalue, Shieldblockvalue, Blockrating, etc.) or if you need help.
If you check over at Rogue Gear Spreadsheet you'll see we've been toying with the idea of creating a rogue database to do predictions and gear tests. Getting access to the bliz-formatted data from the armory is a great help.
I appreciate the code very much, but i get an error when trying to use it. i'm using Win2k (with updated 1.1 .Net framework). Here's a screenshot of the error:
Also, if it helps, I've attached my stats.txt file (i didn't change any of the others)
I appreciate the code very much, but i get an error when trying to use it. i'm using Win2k (with updated 1.1 .Net framework). Here's a screenshot of the error:
Also, if it helps, I've attached my stats.txt file (i didn't change any of the others)
alright, I forgot to test that again before uploading. The solution ist pretty simple:
Insert a newline before the line with the stats in the stats.txt.
Incidentally, I've been using the armory as well and I've noticed it leaves a lot to be desired: various items are not loadable from the armory on an apparently random basis. As I mentioned in another thread, I think this is probably a data-mining protection mechanism much like the servers have to prevent data mining with item links.
Hrm, does anyone have a working parser? This other thread [WoWEquip/Item Database doesn't seem to have something I can use, so for now I'm stuck manually typing item stats into Excel.
based on initial tests, I think I can have a full armory-sourced set of files with each blizzard item. My questions to you are:
would having this data in a local file format be useful to you?
If so, what file type would be most accessible?
- xml (by far the easiest for me)
- Microsoft Access
- Any format Access can export to (including tab delmited text)
XML.
Also, if someone wanted to host an xml-rpc/soap service, that would be nice too. :-)
Thott/allakhazam/armory/whatever are fine, but without being able to mine that information without screen-scraping, it sucks. (*really* wish armory had a soap interface)
+ what are the max number of on-equip / on-use features per item?
- for example, kiss of the spider has 1 on use (haste rating) and 2 on equip spells (hit rating, crit rating). can an item have 3 of one and 0 of another? can an item have more than 3 combined?
Just off the top of my head,
14:11:24 called in wowhead_item::start:324 Item not found!
I actually picked this up the other day as something to do while on business travel. Didn't realize there were any further developments since the original post and hadn't seen anyone else's effort til I was ready to post.
Took a quickish but very dirty attempt in python.
run it in the directory with all your XML tooltip files, and you'll get an output.csv file that has each unique field as a column header.
Several sacrifices were made for simplicity, but if I was still maintaining my hunterlist, I'd be able to easily use the output in conjuction with my spreadsheet's sort/filter tools.
rename to .py extension
edit: as far as I can tell, this depends on having something similar to the individual .xml file per tooltip that disquette originally provided
Last edited by Lurchington : 08/09/07 at 12:46 AM.