Get the Confluence Storage Format

This article covers how to get the starge format of a page.

The Confluence Storage Format is the technical under-the-hood view of a Confluence page. It’s what WikiTraccs “sees” and transforms when performing the Confluence to SharePoint migration.

How to view the storage format for a Confluence page? Two options.

Option 1: Use the browser to view the storage format

Here’s the Atlassian documentation on how to view the storage format: How to retrieve Confluence Storage Format

It boils down to this:

Open a page, choose the three-dot-menu, and choose "View Storage Format".

A new window will open showing the storage format.

If this option is not available, continue with the next section.

Option 2: Configure WikiTraccs.GUI to save the storage format to disk

WikiTraccs knows the storage format of each page it migrates from Confluence to SharePoint.

The storage format by default is only kept in memory and not persisted to a file.

Here’s how to tell WikiTraccs to store the storage format of every page to a file:

  1. Open the WikiTraccs.GUI folder (this is the folder where WikiTraccs.GUI.exe is stored as well)
  2. Create an empty file appsettings.json inside the WikiTraccs.GUI folder
  3. Open the appsettings.json file in a text editor and put the following text in there:
    {
        "CustomSettings": {
            "Debug": {
                "SaveTransformationInputToDisk": true
            }
        }
    }
    
  4. Save appsettings.json
  5. If WikiTraccs.GUI is open: close it and any other (console) windows it opened
  6. Open WikiTraccs.GUI and start a migration

WikiTraccs now stores the storage format for every newly migrated page in a file, in the attachment registry.

The attachment registry is a folder where attachments from Confluence are downloaded to, while migrating. You can find it in the AppData folder of your user account.

The attachment registry path for a Confluence page looks like C:\Users\<username>\AppData\Local\WikiTraccs\<confluenceuserkey>\<confluencebaseurl>\Attachments\<pageid>.

Here’s an example screenshot:

Attachment registry path for Confluence page 118587415.

The storage format for a page is stored in a file that is named like xhtml_before-<spacekey>-<pageid>.xml.

Now you can access the storage format for every page that is part of your Confluence to SharePoint migration.

Last modified June 2, 2023