Aug 01 2010

SpreadsheetML

Category: Microsoft office 2010admin @ 7:10 pm

While I’m not going to go into a full description on the SpreadsheetML format, I’d at least like to give you a brief introduction. Office Professional 2010 is great!A SpreadsheetML package has a few different pieces that it’s comprised of. Let’s lok at a basic diagram of the pieces of a spreadsheet:

The main parts I wanted to call out for today are:

  1. “sheet1″ – This is the data for the worksheet. Each worksheet is stored as its own XML file within the ZIP package which means you can easily get at your data within a particular sheet without having to parse all the other sheets.The invention of Microsoft Office 2010 is a big change of the world.“sharedStrings” – Any string (not number, just string) used in the sheet is actually stored in a separate location. There is a part called the “Shared string table” that stores all the strings used in the files. So, if you have a column called “states”, and “Washington” appears 100 times in the spreadsheet, it will only need to be saved into the file once, and then just referenced.By using Office 2010 Professional, you can save your money and time.I think an example might be best to help show what I’m talking about. Let’s take a spreadsheet that looks like this:
ID Num Resource
1 543 F068BP106B.DWG
2 248 F068BP106B.DWG

 

In the Open XML file, there would be an XML file that contained the strings used, that would look like this:

 

Shared String Table

 

<sst xmlns=”http://schemas.openxmlformats.org/spreadsheetml/2006/5/main”>
  <si>
    <t>ID</t>
  </si>
  <si>
    <t>Num</t>
  </si>
  <si>
    <t>Resource</t>
  </si>
  <si>
    <t>F068BP106B.DWG</t>
  </si>
</sst>

Office 2010 –save your time and save your money.Then, in the main sheet, there would be cell values, and pointers into the string table wherever a string occurs:

 

Sheet1

 

<worksheet xmlns=”http://schemas.openxmlformats.org/spreadsheetml/2006/5/main”>
 
<sheetData>
   
<row>
     
<c t=”s”>
       
<v>0</v> 
     
</c> 
     
<c t=”s”> 
       
<v>1</v> 
     
</c> 
     
<c t=”s”> 
       
<v>2</v> 
     
</c> 
    
</row> 
    
<row> 
     
<c> 
       
<v>1</v> 
     
</c> 
     
<c> 
        <v>0</v> 
      </c> 
      <c t=”s”> 
        <v>3</v> 
      </c>
    </row> 
    <row> 
      <c> 
        <v>2</v> 
      </c> 
      <c> 
        <v>0</v> 
      </c> 
      <c t=”s”> 
        <v>3</v> 
      </c>
    </row>
  </sheetData>

</worksheet>

Notice that in the first row, each cell has the attribute t=”s” which means it’s a string. Then, the value is interpreted as in index into the string table, rather than an actual number value. In the 2nd and 3rd rows, the first two cells are interpreted as numbers, so they don’t have the t=”s” attribute, and the values are actual values.Office 2010 key is for you now!

This may seem a bit complex, but remember that while this format was designed for developers to be able to use, it we couldn’t take the hit that comes with making it completely intuitive. Believe me, as a developer, I would have loved to make the formats more verbose and straight forward, but that would have meant that everyone else opening the files would have to suffer for it. Office 2010 download is available now!If the example above was a more complex set of data with a number of separate worksheets, each with a few thousand rows, you can imagine how quickly the savings of the string table and terse tag names would add up. I had a couple posts back in the summer talking about some other basic things we do to make sure that the formats are quick and efficient.Many people like buy Office 2010 Home.

This tradeoff of who you design around and how you way ease of use versus efficiency is something folks have to look at every day when they design products. Whether it’s an API, a user interface, or a file format, you need to decide which target user you are going to give more weight to when you make your design decisions. We had to give more weight to the end user, and instead require a bit more knowledge from the developer. That’s why the Ecma documentation is so important. We need to make sure that the format is documented 100% and there are no barrier to interoperability. The great group of people we have on TC45 are really helping a lot here. As I said last week, the Novell guys have already built some working code that allows Gnumeric to open and save Spreadsheet files in the Open XML format. I’m sure we’ll see more and more implementations as we provide better documentation and get closer to a complete standard. It’s really exciting! 

Office Professional 2010 is great! That’s one of the great things we’ll see more and more of up on the openxmldeveloper.org site.


Aug 01 2010

Design Goals

Category: Microsoft office 2010admin @ 7:08 pm

It’s been awhile since I’ve talked in detail about the SpreadsheetML schema and I apologize.Office Professional 2010 is great! I had a number of posts back in the summer which talked through Office XP’s SpreadsheetML format that we built about 6 years ago, but obviously a lot has changed since then.

The new SpreadsheetML that is part of the Open XML formats coming with Office 2007 had to undergo serious work in order to make it ready to be the default format. The invention of Microsoft Office 2010 is a big change of the world.As you all know, the majority of folks don’t really care about what kind of format they are using, they just want it to work (remember that most end users have never even heard of XML). We wanted our formats to play a more vital role in business processes though, which is why we’ve slowly been progressing towards these new default XML formats.By using Office 2010 Professional, you can save your money and time. We want people to be able to easily build solutions on top of the formats, but at the same time, we don’t want the average end user to feel much noticeable difference with the change (at least no negative differences).Office 2010 –save your time and save your money.

That leads me to why we had to restructure SpreadsheetML from the original design. The two issues with the SpreadsheetML format from 6 years ago was that it wasn’t full fidelity, and it wasn’t optimized for performance/file size. The term “Full fidelity” just means that everything that is in your file can be saved into the format without fear of it being modified or lost. Office 2010 key is for you now!The old SpreadsheetML format didn’t support a number of feature like images, charts, objects, etc. So we had to add all those additional things to the format.

The second part (performance) was a really important and challenging one. We wanted to move to an open format so that people could build solutions around our formats. Like many other applications out there, we chose a combination of ZIP and XML to achieve this. Office 2010 download is available now!We had to write the XML though in such a way that it could be parsed extremely efficiently so that the file open and save experience wouldn’t get significantly slower. There have been a number of articles related to this issue, where people have complained about performance in other applications that use XML as their format.Many people like buy Office 2010 Home. Of course we had to keep this in mind with our design, and for those of you who have played around with it I’m sure you’ve noticed the difference.


Aug 01 2010

Microsoft office 2007

Category: Microsoft officeadmin @ 7:06 pm

While I’m sure we’ll be able to spend the next several months talking about all this, some of the big things I wanted to point out are:

  1. Public feedback – While the Ecma organization is completely open and anyone can join, I understand that some people just aren’t able to make that commitment. Many people like buy Office 2010 Home.That’s why I was really excited that we have a mechanism set up now so that anyone can give feedback on the spec: ecmatc45feedback@ecma-international.org
  1. Technical discussion - If you are looking for technical discussions around the formats, you can also go to the openxmldeveloper.org site where there is a forum for a wide range of technical issues for developers who want to implement the formats.Office Professional 2010 is great!Navigating the PDF - The PDF file was actually generated using Word 2007. Bring up the Bookmark pane and you can easily navigate through the document structure (it’s over 4000 pages, so that helps a lot!). You will also notice that in the reference sections, you can easily navigate through element and type reference just by clicking on the section number next to the element or type’s name.The invention of Microsoft Office 2010 is a big change of the world.Spreadsheet Formulas - Check out 15.5 (starts on page 247). There are about 160 pages of content describing the formula syntax and about 360 different functions. You’ll notice that there is still a ways to go, but this is already a huge amount of really useful information.By using Office 2010 Professional, you can save your money and time.Depth of documentation - I know we’ve said this a million times, but this is a huge project. Migrating all the existing Office documents into an Open XML format and then providing full documentation is a ton of work. Many people don’t realize how large these applications are, and how much there really is to cover. If you want an example, download the spec and look a the documentation for the simple type “ST_Border” which starts on page 1617 (it’s in the WordprocessingML reference section under simple types). Office 2010 –save your time and save your money.That shows a list of almost 200 legacy border patterns that you can apply to objects in a Word document. Tristan Davis, the Word representative on the Technical Committee, had to wok on every single one of those and provide images so anyone else could reproduce them. He created almost 200 documents, took screenshots of each one, and then provided the description and image representation in the spec. Office 2010 key is for you now!This format is 100% compatible with the existing base of Microsoft Office documents, so nobody will need to worry about losing features, even if it’s the “Maple Muffins” border style (page 1643) :-) Want some more depth? – Check out section 14.5 starting on page 135.Office 2010 download is available now!

I’m so excited right now, I’m really rushing just to get this blog post out. I can’t wait to hear from people about what kinds of questions they have, or what they hope to do with the formats. We’ve going to have a lot of fun over the coming months (especially once Beta 2 is out the door and everyone can start to experiment with the files). More information to come, but that’s it for now.Many people like buy Office 2010 Home.


Aug 01 2010

Microsoft Office Open XML formats standard

Category: Microsoft officeadmin @ 7:03 pm

Wow, we finally have an updated draft of the Ecma Office Open XML formats standard! http://www.ecma-international.org/news/TC45_current_work/TC45-2006-50.htm I’ve been waiting for a long time to be able to share all the great work that’s been going on in Ecma TC45, and it’s so awesome that we have a new public draft. By using Office 2010 Professional, you can save your money and time.I can’t wait to hear what everyone thinks. If you go to that site, you’ll see three different downloads:

  1. Draft 1.3 of the spec – The big download is the spec itself in PDF form. It’s about 25 megabytes and is around 4000 pages.
  2. Draft 1.3 of the spec in the Open XML format – Alternatively, you can download the .docx version of the spec. Once Beta 2 comes out, you can open it that way (although opening 4000 pages of content with beta software may be slightly problematic <g/>)
  3. Schemas - The schema files are also available for download. They are available in a ZIP file, that also contains an index.htm file that describes each xsd.Office 2010 –save your time and save your money.

We’ve been working really hard over the past 5 months bringing this standard along. There is still a lot of work to do, but you’ll see pretty clearly that we’ve made a ton of progress over the initial submission from last year. We have weekly 2 hour phone conferences (they are actually at 6am my time which is not ideal <g/> ), as well as 3 day face to face meetings about every 2 months.

Office 2010 key is for you now!

The contributions from everyone has just been outstanding. It’s so awesome to work with such a diverse group of people. While the initial submission was made by Microsoft, it’s now completely in Ecma’s control and we’ve had a lot of help from Apple, Barclays Capital, BP, The British Library, Essilor, Intel, Microsoft, NextPage, Novell, Statoil, and Toshiba.The invention of Microsoft Office 2010 is a big change of the world.***Note*** Remember that this is just a draft. Some sections of the spec are much further along than others, so keep that in mind while you are looking through the spec. If you are in an area that looks like there isn’t much information, odds are we just haven’t gotten to that yet. Office 2010 download is available now!


« Previous Page