This is the multi-page printable view of this section. Click here to print.

Return to the regular view of this page.

Wiki Transformation Project Blog

News About WikiTraccs

Migrating Communardo and Comala Metadata
WikiTraccs Creates Images for Draw.io Pages
Exporting Historic Page Versions - It's Complicated
WikiTraccs Creates Draw.io Preview Images
Using the PnP Page Navigator Web Part as Table of Contents Replacement
How Are Excerpt Include Macros Handled?
Support Case: Missing draw.io Images
New WikiPakk Children Display Web Part and Usage Metrics
How to Export a Link Map
Sharing Content Across SharePoint Pages Is Impossible
How Nested Confluence Macros Are a Migration Challenge
M365 Authentication, API Requests, and Blocked Endpoints
Migrating Confluence Cloud Whiteboards
Harnessing the New SharePoint Page Format
Splitting a Space
Testing WikiTraccs for Markdown
Finding the Confluence Version
WikiTraccs for Markdown
Making SharePoint Tables Look Pretty
How much time will a Confluence to SharePoint migration take?
Confluence Quirks
Good Practices for your Confluence to SharePoint Migration
Confluence Cloud Specialties
WikiTraccs switches to Confluence Cloud v2 REST API
Migrated SharePoint Page Names Explained
How to migrate only some pages?
Confluence vs. SharePoint - Part 1: Overall structure
Confluence Link Types Explained
Mapping user accounts from Confluence to SharePoint
Confluence Authentication Overview
Confluence might misreport space contents
Can feature 'XYZ' be added to WikiTraccs?
Converting Gliffy and draw.io to SVG
Migrating Gliffy and draw.io macros to SharePoint Online
Transforming even more colors from Confluence to SharePoint!
How to run parallel WikiTraccs migrations?
Fixing image positioning in SharePoint
Broken inline image positioning in SharePoint
How to migrate rich Confluence tables to limited SharePoint tables?
Migrating large Confluence spaces to SharePoint
Ignoring macros when migrating pages from Confluence to SharePoint
Registering WikiTraccs as app in Azure AD
Registering WikiTraccs as app in Entra ID
The art of positioning images - part 2 of 2
The art of positioning images - part 1 of 2
Confluence Page Tree in SharePoint
What about those Confluence Macros?
What to expect from WikiTraccs?
New attachments macro transformation
Mapping principals and migrating permissions
WikiTraccs Quick Start Video out now!
New release - UI overhaul
Measuring page migration success
A new approach to configuring WikiTraccs
Announcing WikiTraccs

New Releases

WikiTraccs 1.27.16
WikiTraccs 1.26.18
WikiTraccs 1.25.11
WikiTraccs 1.24.20
WikiTraccs 1.23.16
WikiTraccs 1.22.11
WikiTraccs 1.21.11
WikiTraccs 1.20.43
WikiTraccs 1.20.31
WikiTraccs 1.20.15 (Prerelease)
WikiTraccs 1.20.6 (Prerelease)
WikiTraccs 1.19.13
WikiTraccs 1.19.5
WikiTraccs 1.18.19
WikiTraccs 1.18.15
WikiTraccs 1.18.13
WikiTraccs 1.18.11
WikiTraccs 1.18.10 (Beta)
WikiTraccs 1.17.5 (Prerelease)
WikiTraccs 1.17.7 (Preview)
WikiTraccs 1.17.4
WikiTraccs 1.16.2
WikiTraccs 1.14.13
WikiTraccs 1.14.10
WikiTraccs 1.14.6
WikiTraccs 1.14.8
WikiTraccs 1.14.2
WikiTraccs 1.13.1
WikiTraccs 1.13.0
WikiTraccs 1.12.29
WikiTraccs 1.12.16
WikiTraccs 1.12.6 (Prerelease)
WikiTraccs 1.12.5
WikiTraccs v1.11.16
WikiTraccs v1.11.12 (Prerelease)
WikiTraccs v1.11.9 (Prerelease)
WikiTraccs v1.11.6 (Prerelease)
WikiTraccs v1.11.1
WikiTraccs v1.10.24 (Preview)
WikiTraccs v1.10.12
WikiTraccs v1.10.9
WikiTraccs v1.10.8 (Preview)
WikiTraccs v1.10.4 (Preview)
WikiTraccs v1.9.0
WikiTraccs v1.8.6
WikiTraccs v1.8.4 (Preview Release)
WikiTraccs v1.7.6
WikiTraccs v1.7.4
WikiTraccs v1.7.1 (Update Mode Preview)
WikiTraccs v1.6.35
WikiTraccs v1.6.32
WikiTraccs v1.6.31
WikiTraccs v1.6.29
WikiTraccs v1.6.11
WikiTraccs v1.6.8
WikiTraccs v1.6.6
WikiTraccs v1.6.4
WikiTraccs v1.5.0
WikiTraccs 1.4.6
WikiTraccs 1.4.2
WikiTraccs 1.3.14
WikiTraccs 1.3.13
WikiTraccs 1.3.8
WikiTraccs 1.3.7
WikiTraccs 1.2.2
WikiTraccs 1.1.1
GitHub Releases

This is the blog section. It has two categories: News and Releases.

Files in these directories will be listed in reverse chronological order.

News About WikiTraccs

Migrating Communardo and Comala Metadata

This blog post shows the current progress of metadata migration with regard to Comala Document Management and Communardo Metadata for Confluence.

By Heinrich Ulbricht | Friday, June 06, 2025

WikiTraccs now migrates metadata from Comala Document Management (former Comala Workflows) and Communardo Metadata for Confluence.

Where to Find Migrated Metadata in SharePoint?

WikiTraccs migrates metadata as page labels. You can see them in SharePoint in the Site Pages library. Note that you might have to add the Confluence: Labels (WikiTraccs) column to the current Site Pages list view, to see the labels.

For Communardo Metadata for Confluence you’ll see labels like those, representing metadata that was attached to the Confluence page:

migration:[communardometadata.global.metadatafield.orgunit=Atlassian Solutions]
migration:[communardometadata.global.metadatafield.state=Finalized]
migration:[communardometadata.global.metadatafield.continue=No]
migration:[communardometadata.global.metadatafield.fiscalyear=2025/2026]
migration:[communardometadata.global.metadatafield.quarter=]

For Comala Document Management you’ll see those entries:

migration:[comala.workflowName=Review Workflow]
migration:[comala.state.name=Active]
migration:[comala.state.dueDate=1748516063000]

Those are the workflow name, the name of the current state, and the due date (in Comala-native format).

Labels that were added by WikiTraccs have the migration: prefix to separate them from regular page labels, which mostly have the global: prefix.

Technical Notes

Metadata migration is always enabled when starting a migration run.

When starting a migration run, WikiTraccs will check if programming endpoints from both Communardo and Comala plugins are available. If they are available, metadata will be retrieved for every page. If those endpoints are not available, WikiTraccs will disable metadata migration efforts for those plugins after some retries.

How to Use the Metadata in SharePoint?

Since metadata from the Comala and Communardo plugins ends up in plain text format in a text column in SharePoint, you’ll probably need post processing to do something meaningful with that data.

I recommend using PnP PowerShell to read the labels, transform them to a format that’s useful to you, and store them at a place that makes sense in your context.

WikiTraccs Creates Images for Draw.io Pages

This post shows WikiTraccs exporting each page of a draw.io diagram as image.

By Heinrich Ulbricht | Friday, May 23, 2025

Note

The functionality described in this blog post is available as of WikiTraccs v1.27.1.

This post builds on the last one that highlighted how WikiTraccs can create missing draw.io preview images: WikiTraccs Creates Draw.io Preview Images.

But why stop at missing draw.io preview images? Let’s apply the export logic to draw.io diagram pages as well.

How Do Diagram Pages Work in Draw.io?

Draw.io diagrams can contain multiple diagram pages which are represented as tabs at the bottom of the draw.io editor.

Here is a simple sample diagram, showing the first of three pages:

Clicking a tab will switch to the corresponding diagram page.

A draw.io macro by default shows a single diagram page, although you can cycle through the pages:

In the draw.io macro settings you can select the diagram page to show when opening the Confluence page:

When you migrate a Confluence page (that contains draw.io) to SharePoint and not all diagram pages are shown via draw.io macros, most diagram pages tended to be invisible in SharePoint.

WikiTraccs changes that.

Creating Draw.io Page Images in SharePoint

WikiTraccs can create images for all diagram pages in a draw.io diagram:

It doesn’t matter if all those diagram pages are shown on a Confluence page.

The moment a single diagram page is shown on a Confluence page, WikiTraccs will export all diagram pages as images and migrate those to SharePoint Online.

You’ll see a collapsed section labeled Page Snapshots of Draw.io ‘DIAGRAMNAME’:

Expanding this section shows all diagram pages:

As shown in above screenshot, WikiTraccs will:

create a section that is add after the section that contains the first draw.io diagram page
create an image for each of the diagram pages
format those images using a layout table, as SharePoint otherwise is not able to put images next to each other

Each of the exported diagram page images is stored as additional attachment of the SharePoint page. Those pages are marked with the -wt suffix.

Note

WikiTraccs will export images from the latest version of the draw.io diagram file. Draw.io macros in Confluence might’ve referenced an older version, so there could be visual differences between those macros and the exported images shown in SharePoint. A note in the section title will hint at the fact that the images are from the latest diagram version (’latest version’).

I’m not yet satisfied with how the section looks in SharePoint. Is it at the right place? Should the section be lower in the page? How about a section background color? Should the formatting table have borders? I appreciate feedback on this matter.

Known Limitations

When there are multiple multi-page diagrams on a page, the sections in SharePoint will be generated in reverse order.

Exporting Historic Page Versions - It's Complicated

This post highlights challenges when it comes to exporting (or migrating) historic Confluence page versions.

By Heinrich Ulbricht | Thursday, May 08, 2025

The origin of this post is the requirement to migrate historic page versions to SharePoint Online. WikiTraccs currently doesn’t do that.

Could WikiTraccs migrate historic page contents? It’s complicated and also a matter of expectations.

In this post, we first look at Confluence’s own limits in properly representing page history. We then look at how macros behave. We’ll also explore options to export historic page versions from Confluence. And finally we look at what that means for WikiTraccs in the context of Confluence to SharePoint migrations.

Let’s start with breaking page history by messing with attachments.

Version Note

The tests in this article were conducted in Confluence 8.7.1.

Changing Attachments Changes Page History

The following steps show how to change page history by overwriting an attachment file that is used by the historic page. The historic page will show the updated file (but shouldn’t).

This page shows two old Atlassian logos:

The attachments for this page show the two image files as atlassian-logo.png and atlassian-logo2.png:

Now let’s assume Atlassian rebrands and the page is updated with the new logo (creating a new page version):

The image file for the new logo is uploaded as atlassian-logo.png, overwriting the existing file that has the same name. The other attachment, atlassian-logo2.png is not needed anymore and thus deleted:

Looking at the page history, we now got two page versions:

And now things get weird.

Comparing the two page versions makes evident that us changing the attachments also changed the page’s history:

Let’s export the page as PDF using the built-in Export to PDF function that Confluence offers.

Here’s how the export looks for the current version of the page; this looks correct:

Now switch to the older page version (you guess where this is going…). This is Confluence showing version 1 of the page:

Here’s how the export looks for old page version; this does look unexpected, as both the historic images and the historic text are absent:

Exporting old page versions to PDF is not supported. Confluence always exports the current version.

Exporting Old Page Versions to PDF is Not Supported by Confluence

Just to make that really clear: using the out-of-the-box PDF export of Confluence, you can only export the current page version. Even when viewing a historic version, using the export function will export the current version. This also applies to the Word export, by the way.

There might be third-party solutions available that solve this problem. But we are looking at out-of-the-box features.

Unclear What to Expect from Macros

Let’s see how macros handle being shown on historic page versions. The results will differ from macro to macro and are not always correct.

Children Display Macro

Here’s the built-in Children Display macro added to our sample page, correctly showing one child page:

By editing the page to add the macro, we created a new page version. When viewing a historic version of the page, the expectation could be to see exactly the above state, showing the one child page (spoiler: that doesn’t work).

Now we edit the page and add some text, while at the same time adding a second child page. The result is additional text on the page and the macro showing two child pages:

Now let’s go back in history and view the previous page version:

To be honest, I did not expect this. I expected the Children Display macro to show up and to show the current state of the page tree (so, two child pages).

Apparently the Children Display macro cannot handle history, at least in Confluence version 8.7.1.

Team Calendar

How is the Team Calendars macro handled when going back in page history?

Here’s our modified sample page, showing a Team Calendar with Adele’s vacation:

Editing the page (creating a new page version), we change the macro configuration to show all vacations as list. Also, Parzival’s vacation has been added, making it two entries in the list:

Now let’s go back in page history, viewing the previous page version.

The historic Team Calendar macro correctly applies the old configuration (showing a calendar, not a list), but the vacation data is current. It shows both vacations, even though at the time this page version was created, only Adele’s vacation existed:

The result is a mix of historic macro configuration and current data.

By the way, exporting the current page version to PDF is broken as the calendar content is missing entirely:

How about Migrating Historic Page Versions to SharePoint Online?

In my view, Confluence fails to properly show and export historic page versions. In many cases there are valid technical reasons for that, but that doesn’t help.

On top of that, historic page behavior depends on how macros are implemented, which differs from vendor to vendor.

WikiTraccs cannot change any of the above if it tried to migrate historic Confluence page versions to historic SharePoint page versions.

Given the limitations and behavioral differences of SharePoint, the migration result for historic page versions would be even less appealing than it currently is using out-of-the-box tooling in Confluence.

There is one approach, however, that would allow migrating historic Confluence page versions to SharePoint while staying as close as possible to what Confluence offers - and that is printing the page.

Printing Historic Pages to PDF

The approach really is that simple: open a historic page version in the browser; print the page to PDF using the browser.

Here’s different versions of our sample page printed to PDF:

The current page version.

Historic page version, showing the Team Calendar as calendar.

Historic page version, showing the broken Children Display macro.

The very first historic page version, showing the modified images.

Printing pages to PDF has the following benefits:

the historic content should be near identical to what Confluence shows in the browser when viewing a historic page version (although there seem to be styling deviations)
the PDF contains information like author, version, labels etc. - which can be a benefit
the printed PDF seems to capture more content than the built-in PDF export as the Team Calendars sample showed

There are also challenges with printing pages to PDF:

very wide pages might get cut off if the PDF page is not wide enough
Confluence Cloud adds floating menus and headers to the page that appear when printing the page, which might overlap with page content
Contents in tab-like macros, collapsed Expand macros, etc. arent’t covered, as the printed PDF only shows what’s visible when opening the page initially
there might be challenges with multilingual pages, where the current page content looks different, depending on the user’s chosen language

How about Third-Party Solutions?

So far, we looked at the Confluence standard.

There might be third-party solutions out there that allow exporting historic pages properly. If such a third-party solution would provide a REST API then WikiTraccs could leverage that to export pages.

Let me know if you know any good third party solutions that could help.

Wrap Up

Confluence fails to meet expectations with regard to showing and exporting historic page versions.

Also, behavior of “historic macros” will differ from vendor to vendor.

Historic page versions might show broken macros, show a mix of historic macros and current data, and cannot be exported to PDF (using the out-of-the-box method).

WikiTraccs currently does only migrate the latest page version to SharePoint Online and - given all the limitations Confluence has when it comes to historic page versions - will not try to be smarter than Confluence.

Printing historic pages to PDF might be an option to migrate historic page snapshots to SharePoint Online, while staying as close to the Confluence standard as possible.

WikiTraccs Creates Draw.io Preview Images

How to get draw.io preview images back and introducing the draw.io viewer in WikiTraccs.

By Heinrich Ulbricht | Wednesday, April 16, 2025

Note

The functionality described in this blog post is available as of WikiTraccs v1.26.11.

In Migrating Gliffy and draw.io Macros to SharePoint Online we briefly looked into how draw.io macros are migrated to SharePoint Online.

In Support Case: Missing draw.io Images we looked at how draw.io preview images are used by Confluence and WikiTraccs, and how they can be missing.

In this post we work towards a solution to create missing preview images and thus get our images back on migrated SharePoint pages.

Video Introduction to Draw.io Migration and Preview Image Generation

This 8-minute video briefly goes into how draw.io macros are migrated, why draw.io preview images are important for the migration, and how you can use WikiTraccs to bring them back if they are missing:

How WikiTraccs Can Create Missing Draw.io Preview Images

Since the only way to export a draw.io diagram is to show it in a browser first, WikiTraccs will do exactly that.

When encountering a draw.io diagram in Confluence where the preview image is missing, WikiTraccs tries to open the draw.io diagram in the browser to export it as image.

Here’s how that works:

a page with draw.io diagram diagram1 is being migrated by WikiTraccs
WikiTraccs looks for the diagram1.png preview image; if it is present: fine, us that; if it is missing, continue
WikiTraccs opens a browser to show the draw.io viewer, as documented here: https://www.drawio.com/doc/faq/embed-mode
WikiTraccs loads diagram1 into the viewer (the diagram should be visible now)
WikiTraccs exports the diagram to a PNG image file and uses that from now on

Now there should be a diagram image on the migrated SharePoint page as well.

Note: This works for draw.io diagrams that are stored as direct Confluence page attachment. It won’t work for diagrams that are embedded to the Confluence page from an external location like OneDrive.

Prerequisites for Draw.io Preview Image Generation

Note

If you use Interactive as authentication mode for Confluence, you should be good to go; interactive mode also opens a Chrome browser controlled by WikiTraccs.

The following prerequisites need to be met:

WikiTraccs must be able to start and control Chrome to show the draw.io diagram viewer in Chrome, and to be able to export the diagram; specifically:
- Google Chrome must be installed on the machine WikiTraccs is running on
- Google’s endpoints must be available for WikiTraccs to download the Chrome WebDriver required to remote-control Chrome
No cookies.txt authentication workaround is being applied (note: this is a rarely used configuration)
Endpoints required by the draw.io diagram viewer need to be accessible from Chrome, where the diagram view will be shown by WikiTraccs
- https://viewer.diagrams.net - to load the viewer and image resources
- potentially more, depending on diagram content

Firewall

If you are in a locked down environment where outgoing connections need to be whitelisted, you will have to adjust the configuration to allow loading the diagram viewer resources from viewer.diagrams.net.

Draw.io Diagram Viewer Showcase

Below you see how the diagram viewer opened by WikiTraccs will look like.

If you see the diagram, the viewer resources seem to be accessible at least from this browser:

To verify the result, here’s a screenshot of how above diagram should look:

Screenshot of the draw.io diagram that should be shown in the diagram viewer above.

How to Enable Draw.io Preview Image Generation

In the WikiTraccs Settings dialog, check Create missing draw.io preview images:

Known Limitations

Currently, there is no indication about external resources in diagram files. (Note: This will change in a future release.) If icons fail to be downloaded by the draw.io diagram viewer, those icons might be missing in the preview image as well. See also the sections External Resources and Blocking External Resources below.

Preview image generation works for draw.io diagram files that are page attachments in Confluence. There is an option to embed draw.io diagrams from external services like OneDrive or Google Drive - those cannot be handled.

In the wild WikiTraccs encountered a broken (?) draw.io macro that failed to properly reference the shown diagram layer which lead to a preview image being empty. So far, this was encountered once but if there is one case, there might be other such cases. Please report if preview images (generated by WikiTraccs) are empty.

WikiTraccs will export images from the latest version of the draw.io diagram file. Draw.io macros in Confluence might reference an older revision, so there could be visual differences between those macros and the exported images shown in SharePoint. I’m not sure if this is even an issue as the draw.io macro itself seems to prefer the latest diagram version (at least when linking across sites). A note above the image will hint at the fact that the image shows the latest version (example: ’note: latest version instead of v1’).

Privacy Notes

Diagram data stays local.

To cite drawio.com:

Our […] editor is loaded as a static application […], but the diagram data is passed entirely client-side between windows, it’s never sent back to, or sourced from, the draw.io application server. This means you control and store your data […].

So, while the editor is loaded from viewer.diagrams.net into the browser (to have a canvas to draw the diagram to), diagram data is processed locally.

External Resources

If a diagram contains links to external resources (for example external images not directly embedded in the diagram), the diagram viewer will download those resources. The external host will receive and can observe that request.

Blocking External Resources

When you block direct access to external resource hosts, the diagram viewer doesn’t give up; instead, it downloads resources via a proxy endpoint. This proxy is hosted on viewer.diagrams.net.

A call to the proxy endpoint looks like this: https://viewer.diagrams.net/proxy?url=https://upload.wikimedia.org/512px-Teams.png

So, when you block access to (for example) upload.wikimedia.org in your environment, and if viewer.diagrams.net can be connected to, the proxy will get the image for you.

In this process, the proxy learns about the URL of the external image.

If you want to prevent the proxy learning about external resource URLs, block the proxy endpoint as well. External resources will now be missing in the exported diagram, e.g. icons or images.

Note

Above information only applies to resources which are not part of the diagram file (thus being external). In most cases, icons, images, lines, text, etc. are part of the draw.io diagram file, which makes them internal resources that don’t need to be loaded from external hosts.

Using the PnP Page Navigator Web Part as Table of Contents Replacement

How to use the PnP Page Navigator Web Part as Table of Contents Replacement.

By Heinrich Ulbricht | Sunday, April 13, 2025

With release v1.26.1 WikiTraccs introduced macro transformation templates that allow converting macros to SharePoint web parts.

Let’s try converting the Table of Contents macro to the PnP Page Navigator web part, which provides similar functionality.

What is PnP Page Navigator?

The PnP Page Navigator web part shows the headings of the SharePoint page it’s placed on. Clicking the heading entry will jump to the heading.

The Page Navigator is a SPFx solution that is developed by the PnP community, a group of people that creates open source solutions for the Microsoft 365 ecosystem.

The web part is available in the sp-dev-fx-webparts repository on GitHub: Page Navigator.

It seems to be distributed as source code, so normally you’d have to set up a development environment and build it for yourself. I did this for the current version 1.10 (released on March 15, 2025).

You can download the SPFx solution package for version 1.10 of the Page Navigator here: react-page-navigator.sppkg.

Note that I take no responsibility what’s in the solution. I merely followed the instructions on the solution’s README and built it from source code.

Creating a Macro Transformation Template

To tell WikiTraccs to transform the Table of Contents macro to the Page Navigator web part we need to create a macro transformation template.

It looks like this:

<script id="data-sp-webpartdata" type="application/json">
{
"id": "bcac4d9d-adf5-4462-97c5-e5f3e97dd518",
"instanceId": "{{{RandomGuid1}}}",
"title": "Page Navigator",
"description": "Page Navigator builds a navigation structure on the page, based on the headers in your text.",
"audiences": [],
"serverProcessedContent": {
    "htmlStrings": {},
    "searchablePlainTexts": {},
    "imageSources": {},
    "links": {}
},
"dataVersion": "1.0",
"properties": {
    "stickyMode": false,
    "stickyParentDistance": "1"
},
"containsDynamicDataSource": false
}
</script>

<div data-wikitraccs-webpart-template="true" data-sp-controldata="data-sp-controldata" data-sp-webpartdata="data-sp-webpartdata">🚧 Moved Page Navigator web part as it cannot be embedded here</div>

Note

Store this template as toc.hbs in WikiTraccs’ template folder to activate it.

Note: At the time of writing this above template is not yet available in the WikiTraccs templates folder, but it will be in one of the next WikiTraccs releases.

Testing the Transformation from Macro to Web Part

Note

For the Page Navigator web part to be available in a site, the solution package must have been installed to an app catalog and the app been added to the site you want to use the web part in.

Here’s our Confluence test page:

Note that table of contents macro at the start of the page.

Here’s the modern SharePoint Online page that WikiTraccs creates:

Note that the Table of Contents macro has successfully been transformed to the Page Navigator web part.

Closing Notes

While testing I stumbled over some effects that look like bugs in the Page Navigator web part:

I find it also important to note that the Page Navigator web part loads the whole page content again, to scan for headings. This might be a strain on mobile connections and something to consider when targeting environments where the amount of transferred data matters.

How Are Excerpt Include Macros Handled?

How does WikiTraccs handle macros like Excerpt and Excerpt Include, which can be used to reuse content across pages? This post has answers.

By Heinrich Ulbricht | Wednesday, April 09, 2025

In Confluence it is common to reuse parts of a page in other pages.

One way to do that is using the Excerpt macro to define a reusable content snippet and then, on another page, the Excerpt Include macro can be used to reference and show that reusable snippet.

In SharePoint there is nothing that compares to this function as I outlined in Sharing Content Across Pages Is Impossible.

So, when encountering those macros, how does WikiTraccs handle them?

Excerpt, Multiexcerpt

Let’s look at the source of the reusable snippets.

WikiTraccs currently handles Excerpt and Multiexcerpt explicitly.

The body content of those macros will be migrated as if it was normal page content.

The enclosing macro does matter in one regard - it has a hidden parameter. This parameter specifies if the macro is visible on the page or not.

How does WikiTraccs handle the value of the hidden parameter? That depends on the macro transformation templates for Excerpt and Multiexcerpt.

At the time of writing this, hidden Excerpt macros are skipped (the migrated page does not contain its content), but the content of hidden Multiexcerpt macros is migrated and marked as originating from a macro. This has historical reasons. The behavior can be changed by changing the macro transformation templates.

Excerpt Include, Multiexcerpt Include

Let’s look at the macros that can show the reusable content snippets defined by Excerpt and Multiexcerpt: Excerpt Include and Multiexcerpt include. Those are the two that WikiTraccs handles (at the time of writing this).

When encountering one of those Excerpt/Multiexcerpt Include macros, WikiTraccs loads the referenced page that contains the corresponding Excerpt/Multiexcerpt macro, copies its body content, and inserts it into the referencing page at the place of the Include macro.

WikiTraccs takes care of adjusting links and image references in the copied content so that they continue to work in the new context.

What WikiTraccs doesn’t do is copying images (or other attachments). If the resusable snippet contains an image, this image will be referenced also by the content that WikiTraccs copies to other pages. So, the image is not copied to other pages.

Transformation Templates

The handling of the Excerpt Include and Multiexcerpt Include macros is somewhat special in that they use the new (note: as of WikiTraccs v1.26.0) macro transformation stages builtin and postbuiltin.

When handling the Excerpt Include and Multiexcerpt macros in the builtin stage of macro transformation WikiTraccs does not yet remove those macros from the source page tree, but enriches them with the copied macro body of the corresponding Excerpt and Multiexcerpt macro on the referenced page.

Now that the macros have a body, the transformation template for the next stage, the postbuiltin stage, takes care of replacing the macro with its body content and marking it.

The excerpt-include[postbuiltin].hbs transformation template is responsible for marking the copied content with two dividers and the link to the source page.

Support Case: Missing draw.io Images

How draw.io images can be missing, the consequences, and how to work around that.

By Heinrich Ulbricht | Tuesday, April 08, 2025

Update

WikiTraccs can now bring missing preview images back! See WikiTraccs Creates Draw.io Preview Images.

In Migrating Gliffy and draw.io Macros to SharePoint Online I briefly explained how draw.io macros are migrated to SharePoint.

The post briefly describes how there are two assets for each draw.io diagram in Confluence:

the draw.io raw diagram file (file without extension)
a preview image of the diagram (PNG file)

From time to time clients encounter cases in their environment where the preview image is missing. What are the consequences of a missing preview image?

How Is the Preview Image Used in Confluence?

Let’s look at this page, which has two diagrams, one with its preview image missing:

The attachments list looks like this; note there is only one preview image present (preview-image-available.png), I manually deleted the other one:

Looking at above screenshots, let’s start with the one place where the missing preview image is no issue: the actual diagram on the Confluence page. So, you don’t immediately notice that the preview image is missing when looking at the page, as the diagram is properly shown.

Missing in Page Edit Mode

When editing the page this looks different. Apparently the preview image is required here:

Sometime a placeholder is shown, like above, sometimes there is a “broken image” icon.

Missing in Page Exports

The diagram will be missing in page exports.

Here’s the PDF export of our sample page that contains two draw.io diagrams; but only one shows up in the export:

Missing in Emails

There are use cases in which Confluence page content is sent via email. Those uses cases also rely on the preview images being present.

If the preview image is missing, the image won’t show up in the email that gets sent.

Missing in REST API Integrations

There are REST endpoints that can be used to export macros in automated scenarios.

Normally, data returned by those endpoints contains the image (if the macro has one). The PlantUML macro is one such example where a proper image is being generated and returned. Or the Roadmap macro.

Those REST endpoints can be used by tools like WikiTraccs to further process the page, or other script-based solutions that further process the page.

For draw.io, if the preview image is missing, it will be missing for all REST API consumers as well.

Missing on Migrated Page in SharePoint

WikiTraccs relies on the preview image to be present. It’s what’s shown on the SharePoint page.

If the preview image is missing, no image can be shown.

Note that the raw draw.io diagram file will be migrated as page attachment and could theoretically be downloaded and opened in the draw.io desktop application.

How to Check If I Am Affected?

Draw.io provides a built-in integrity checker that is available to Confluence administrators:

The integrity check will also check for missing preview images.

The integrity check result then looks like this:

The following diagram images were not found:
Total: 1
Macro: drawio, on page: http://wiki.contoso.com/pages/viewpage.action?pageId=22183937, Diagram image file not found: preview-image-missing.png (version: 1)

Unfortunately, draw.io doesn’t provide a repair function.

Version Note: Tested on Confluence Data Center 8.7.1 with draw.io version 13.1.10.

How to Get The Preview Image Back?

Update

WikiTraccs can now bring missing preview images back! See WikiTraccs Creates Draw.io Preview Images.

There seems to be only one way, to get the preview image back: edit and save the draw.io diagram in the browser.

Saving the diagram is the moment where the preview image is created and stored.

The preview image seems to be created in the browser, which would explain why there is no other mechanism to trigger the generation of this image. The draw.io diagram has to be open in a browser. Only then is it available to be saved to file by the draw.io editor.

Let me know if you know of another way to get the preview image back.

Note

While testing the draw.io integrity checker, I noticed that it failed to recognized re-created preview images. They still showed up as missing. Not sure if that is an issue of my test environment, or a general one.

Confluence Cloud and External Sources

In Confluence Cloud, using the Embed draw.io Diagram macro, it’s possible to include draw.io diagrams that are stored in OneDrive and other cloud locations.

When you visit a page with OneDrive integration for the first time, the following message will be shown:

"Authorize draw.io to access OneDrive"

When clicking the message, you’ll be asked to authorize draw.io:

"Authorize draw.io to access OneDrive"

After giving authorization, the diagram should be shown. Unless there is a permission error.

In case of permission error, the following message will be shown:

"Error: Access Denied. File not found or you do not have permission to access 'diagramname.drawio' on OneDrive."

When the access issue is resolved, the diagram will be shown.

Now the caveat.

In none of the above cases - even if the diagram is shown - will the preview image of the diagram be generated. Thus, there will be no image in SharePoint Online since it’s only in the diagram macro that the image is being rendered.

This is a technical limitation.

New WikiPakk Children Display Web Part and Usage Metrics

This blog posts highlights new features in the latest WikiPakk update.

By Heinrich Ulbricht | Tuesday, March 25, 2025

The latest WikiPakk release v2.9.0 addresses two often mentioned use cases.

Children Display Web Part

You can now use the Children Display web part to show child pages of the current page.

Note

The Children Display web part is a new web part. So, there are now two WikiPakk web parts available - Children Display, and the Page Tree Editor.

Take this page tree for example:

Now, placing the Children Display web part on the Testspace Migration page shows all of its four children:

The hierarchy information for the Children Display web part is the same as for the page tree and breadcrumb.

The Children Display web part makes it super easy to navigate to child pages.

This three-minute video shows how it works:

Optional Usage Metrics Recording

How much value are your users getting from WikiPakk?

I understand that this question needs to be addressed to allocate budget for a solution like WikiPakk.

WikiPakk has now optional telemetry built in that can be switched on to record usage metrics. It’s turned off by default and needs to be explicitly configured.

When turned on, all data stays within your SharePoint tenant. And again, by default it’s off.

This eight-minute video shows how to set it up:

Above video walks you through:

setting up a central site to store the data
pointing WikiPakk to this central location
creating the SharePoint list that is used to store the data
watching recorded usage events

Note

Your central site might already have been configured if you chose to store your WikiPakk license key there.

Please check the WikiPakk Telemetry article for additional permission-related steps you need to take: How to Set Up Telemetry.

Addressed Issues

The breadcrumb sometimes showed the previous page after navigating. This has been addressed.

The page tree editor web part sometimes failed to save a drag operation. This has been fixed.

Furthermore, the breadcrumb bar was shown when embedding a page using the Embed web part. This has been addressed. When the embed URL contains the IsDlg or env query parameters the breadcrumb bar now stays hidden.

Other Changes

You can now check for updates by clicking the Check for new version link in the about dialog:

Both the Children Display and Page Tree Editor web part got a loading indicator providing better visual feedback than before:

How to Update

The update is available from Microsoft AppSource.

This article shows how to update: Update WikiPakk.

After the update, there should now be two WikiPakk web parts available:

Children Display (WikiPakk)
Page Tree Editor (WikiPakk)

Staged Deployment

In case of a staged deployment you will have to update the WikiPakk solution in both the site collection app catalog of your staging site, and the tenant app catalog.

If you don’t see the two web parts, then you still need to update to the latest version. Refer to Check the Current and Available Version of WikiPakk for details on how to check which version you are on.

How to Export a Link Map

This blog post shows how to export a file that maps Confluence links to SharePoint links.

By Heinrich Ulbricht | Monday, March 17, 2025

When migrating pages from Confluence to SharePoint WikiTraccs makes sure to transform links as well.

Links between Confluence pages will become links between SharePoint pages.

When you change Confluence links in external systems to SharePoint links it might be handy to have a link mapping file that - for each old Confluence page link - shows the new SharePoint page link.

This post shows how you can generate such a link mapping file using a PowerShell script.

Note about Scripting

PowerShell scripting is out of scope for WikiTraccs, but at the same time a part of most migration projects. Scripts of any sort can be used to automate tasks.

Feel free to take the library script that is presented in this post and adapt it to your needs. Please understand that I cannot support custom script development and that you’ll need somebody who can run, modify, and extend PowerShell scripts.

Outline

You’ll use a PowerShell script to create the link mapping file. A sample PowerShell script is available in the library.

Running the PowerShell script requires PowerShell 7 to be installed. Furthermore, the PnP.PowerShell module must be installed.

The PowerShell script connects to SharePoint Online and needs an Entra ID Application Registration Client ID for that. You already have such a client ID if you are migrating pages with WikiTraccs. You can use the same client ID that you enter in the blue WikiTraccs window.

Running the Library PowerShell Script to Create The Link Mapping

In a minute, you’ll run a script that processes the metadata of all migrated pages in a SharePoint site collection and calculates the original Confluence links from this metadata.

Download the Wiki Transformation Project libray as zip file:

Unzip the file you downloaded. (Note: You could also use Git to clone the repository.)

Open scripts/post-migration/create-link-mapping/CreateLinkMapping.ps1 and adjust the parameters at the top of the script, mainly the client ID and the site collection URL.

Note

I recommend using Visual Studio Code and the PowerShell extension to run PowerShell scripts. This allows for easy configuration, debugging, and modification.

Run the PowerShell script.

A new browser tab will open, prompting you to authenticate with Microsoft 365. This is required for the PowerShell script to read page metadata from the Site Pages library.

After authenticating the script should show console output for each page it processes, which could look like on the following image:

The link mapping result is written to a text file called linkmapping.csv which is stored to the current directory.

The content looks like this:

"SharePointPageLink","PageTitleLink","PageTitleLinkAlt","PageIdLink","PageTinyLink"
"/sites/migtarget1/SitePages/TM-Page-S1---Code-Snippet-106070257.aspx","http://localhost:8090/wiki/display/TM/Page+S1+-+Code+Snippet","http://localhost:8090/wiki/pages/viewpage.action?spaceKey=TM&title=Page S1 - Code Snippet","http://localhost:8090/wiki/pages/viewpage.action?pageId=106070257","http://localhost:8090/wiki/x/8YBSBg"
"/sites/migtarget1/SitePages/TM-Page-S6---Table-Cell-Colors-110464825.aspx","http://localhost:8090/wiki/display/TM/Page+S6+-+Table+Cell+Colors","http://localhost:8090/wiki/pages/viewpage.action?spaceKey=TM&title=Page S6 - Table Cell Colors","http://localhost:8090/wiki/pages/viewpage.action?pageId=110464825","http://localhost:8090/wiki/x/OY_VBg"
"/sites/migtarget1/SitePages/TM-Page-S2---Page-Tree-183861249.aspx","http://localhost:8090/wiki/display/TM/Page+S2+-+Page+Tree","http://localhost:8090/wiki/pages/viewpage.action?spaceKey=TM&title=Page S2 - Page Tree","http://localhost:8090/wiki/pages/viewpage.action?pageId=183861249","http://localhost:8090/wiki/x/AYD1Cg"

Using Confluence page title, page ID, and tiny link (all migrated by WikiTraccs) the script calculated possible Confluence links, as those link types are known and follow a common scheme.

The script creates the most common Confluence (on-prem) link types: title link, pageId link, and tiny link.

Note

Confluence Cloud introduced new link types, those are not yet covered. Feel free to share the extended script if you add those.

The script connects to a single site collection at a time. Reconfigure and run this for every site collection you need the link map for. Or extend the script to take care of that and merge the mapping files.

Wrap

In this blog post we looked at how we can use migrated page metadata and a PowerShell script to create a mapping file that maps Confluence page links to the new SharePoint page links.

This can be helpful, for example, when searching and translating Confluence links in third-party systems.

Sharing Content Across SharePoint Pages Is Impossible

In this post we attempt to share content across multiple pages using SharePoint Online’s out-of-the-box features. It’s just not possible.

By Heinrich Ulbricht | Saturday, March 08, 2025

OK, technically speaking, sharing content across pages is possible; however, it’s neither usable, nor does it look good.

Let’s quickly look at the use case then we try different approaches.

Use Case

Sometimes the same content needs to be shown on multiple pages.

Instead of copying the same content into multiple pages, we want to have one place to author the content.

In Confluence this is easy using different excerpt and include macros. You can embed whole pages into other pages, or excerpts of pages into other pages.

This is what we need in SharePoint as well, at least when coming from and knowing the capabilities of Confluence.

Let’s do it.

Embedding a List Item with Rich Text Content

This approach seemed like a really good one.

We can store shared content snippets in a SharePoint list. SharePoint lists supports rich text columns that can contain formatted content.

In modern SharePoint pages, we can then use the Embed web part to embed the list item we want to show.

Technically, this works; but here is how it looks:

Note that the content we want to embed is just the text “Item content”, nothing else.

Furthermore, the web part is way too high, the formatting and positioning is off, and there are user interface elements that we don’t need.

Here’s the URL that was used to embed the item:

https://contoso.sharepoint.com/sites/2025-01-embedding/Lists/Excerpts2/DispForm.aspx?ID=1&Env=embedded

So this is no good.

I also played with JSON-based list formatting but all to no avail. If you know a way to embed a single field of a list item to a SharePoint page, without pulling in a clunky UI around that, let me know.

Embedding Another Page

Maybe we could store shared snippets as separate pages. A little heavyweight, but maybe this works?

Let’s see how that looks, again using the Embed web part to embed another page:

Nearly the same result as with the list item.

The web part doesn’t adapt to the page content’s size, formatting is off, user interface elements are shown that we don’t need.

Even worse - for the embedded page to load, you have to hover the mouse cursor over the web part. Until you do that, the web part looks like this:

No good.

Embedding Another File

Let’s go another route and put our shared content into a Markdown file.

We can embed the Markdown file using the Embed web part as well. The result is comparable to embedding a page, and the user also needs to hover the mouse over the web part for it to show the actual file content.

There is one alternative to the Embed web part, though.

Using the File and Media web part, it is possible to make the file content show immediately, without having to hover over the web part.

Here is the File and Media webpart showing Markdown content from a file:

And again, the web part is way too high and there is no way to adjust that.

Just no good.

Now I’m out of ideas.

Wrap

In SharePoint Online, using modern pages and without using third-party tools, there doesn’t seem to be a way to share content across multiple pages.

This is disappointing since it is the easiest thing in Confluence.

A migration tool like WikiTraccs thus has only two options when migrating shared content from Confluence to SharePoint: copy content to multiple pages, or link to the source page. Both approaches have drawbacks, so it would be nice to see SharePoint ramping up its wiki game here.

How Nested Confluence Macros Are a Migration Challenge

This post describes how nested macros are a challenge for Confluence to SharePoint migrations.

By Heinrich Ulbricht | Tuesday, March 04, 2025

We’ll look at how macros are nested, if and how that can be done in SharePoint as well, and how WikiTraccs approaches this during a migration.

How to Nest Macros in Confluence?

Let’s look at some examples.

If macros have a body, it is often possible to nest other macros in that body.

Here we have a Warning macro nested in an Info macro:

Edit Mode

View Mode

Here are Code Block macros nested in a table:

Edit Mode

View Mode

Here’s a more complex setup using the Column, Section, and Panel macros to put text and Code Block macros in a column layout:

Edit Mode

View Mode

How to Nest Web Parts in SharePoint Online?

It’s not possible.

In SharePoint Online, on modern SharePoint pages, you cannot nest web parts.

SharePoint web parts are put onto the page one after another; here a Text web part, followed by a Code Snippet web part:

It is technically impossible to put the Code Snippet web part into the Text web part.

This is a huge restriction compared to what you can do with Confluence macros.

What Does Missing Nesting in SharePoint Mean for Macro Migrations?

When trying to make nested macros compatible with SharePoint, here are the workarounds that can be applied:

Converting nested macros to text so everything fits into a Text web part
Moving nested macro content to another web part, e.g. moving the code macro to a separate Code Snippet web part

SharePoint also knows page sections, so there might be a chance one can get creative with those. But they are pretty heavyweight and cannot be nested as well.

How Does WikiTraccs Handle Nested Macros?

Let’s see the two de-nesting approaches in action with this Info macro that contains a Code Block macro:

Code Block Nested Inside Info Macro

View Mode

When encountering this in a migration, WikiTraccs does the following:

Convert the Info macro to a table in the SharePoint Text web part to make it look like in Confluence
Create a Code Snippet web part below the Text web part, move the code there
Create a note where the Code Block macro used to be about the de-nesting
Create yet another Text web part right above the Code Snippet web part marking the moved code snippet

The result looks like this (in SharePoint page edit mode):

Note: The dottet lines highlight the three involved web parts; those are not normally there in SharePOint and have been added for emphasis.

Above image also highlights another challenge when splitting content into multiple SharePint web parts. There is a pretty large gap between web parts that can look awkward. But there is nothing that can be done about that.

Wrap

Nested Confluence macros, but also macros in tables and those that are used inline, pose challenges for the migration.

There is only so much a migration tool like WikiTraccs can do, given the limitations that SharePoint Online has.

For the future, I hope and wish that Microsoft further expands the capabilities of SharePoint pages and web parts. I know that the third-party component backing the SharePoint text web part supports something like macros and I’d really like to see the Expand macro implemented in SharePoint.

M365 Authentication, API Requests, and Blocked Endpoints

This post dives into API requests related to Microsoft 365 authentication, what happens when those are blocked, and how to test for that.

By Heinrich Ulbricht | Monday, February 17, 2025

A recent customer support case required deeper insights into the API requests originating from WikiTraccs. A list of endpoints was required to whitelisting those at the proxy level.

While the WikiTraccs Endpoint Reference already covers endpoints, they weren’t detailed enough.

How to get more details?

I’ll describe how to use Microsoft’s Dev Proxy tool to get a list of endpoints.

Using Dev Proxy to Log API Requests

Dev Proxy is a small command line tool that creates a local proxy that can be used to log and change API requests of applications. It’s provided by Microsoft and free to use.

We’ll use Dev Proxy to log all API requests that WikiTraccs makes.

Note

Please refer to the Dev Proxy documentation on how to install it.

Once installed, we configure it. Configuration is done via a JSON file.

The following configuration instructs Dev Proxy to log all requests to the console:

{
    "$schema": "https://raw.githubusercontent.com/dotnet/dev-proxy/main/schemas/v0.24.0/rc.schema.json",
    "plugins": [
        {
            "name": "UrlDiscoveryPlugin",
            "enabled": true,
            "pluginPath": "~appFolder/plugins/dev-proxy-plugins.dll",
            "urlsToWatch": [
                "https://*/*"
            ]
        }
    ],
    "logLevel": "information",
    "newVersionNotification": "stable",
    "showSkipMessages": true,
    "showTimestamps": true
}

Save above text to devproxy.json.

Now we start Dev Proxy and instruct it to monitor WikiTraccs-related processes. Run the following command in a Terminal:

devproxy --config-file "C:\path\to\file\here\devproxy.json" --watch-process-names WikiTraccs.GUI WikiTraccs.Console conhost chromedriver chrome

Note

The --watch-process-names WikiTraccs.GUI WikiTraccs.Console conhost chromedriver chrome parameter restricts Dev Proxy’s operation to those processes. Omit the parameter to cover all applications running at the moment.

Dev Proxy will now log all API requests done by WikiTraccs or the Chrome browser.

API Request for Authenticating with Microsoft 365

Since the customer that triggered this investigation reported problems when authenticating with Microsoft 365, let’s have a look at the API calls that go over the wire when doing that.

When hitting the Test SharePoint Connection button in the blue WikiTraccs.GUI window, the following endpoints are called:

https://login.microsoftonline.com/common/discovery/instance?api-version=1.1&authorization_endpoint=https://login.microsoftonline.com/421cd8a4-daf6-434c-8eae-a685c9af1808/oauth2/v2.0/authorize
https://login.microsoftonline.com/421cd8a4-daf6-434c-8eae-a685c9af1808/oauth2/v2.0/authorize?scope=https://contoso.sharepoint.com/.default+openid+profile+offline_access&response_type=code&client_id=b05e893b-866e-40d3-be10-75e44e5c38c2&redirect_uri=http://localhost:64189&client-request-id=bbc47ed0-c9e4-44e5-b1df-72c6589892e8&x-client-SKU=MSAL.NetCore&x-client-Ver=4.61.3.0&x-client-OS=Microsoft+Windows+10.0.22000&prompt=select_account&code_challenge=snip&code_challenge_method=S256&state=snip&client_info=1
https://login.microsoftonline.com/421cd8a4-daf6-434c-8eae-a685c9af1808/reprocess?ctx=snip&sessionid=a11f9fda-ae88-4a93-8d8b-68a3dd193e7b
https://login.microsoftonline.com/421cd8a4-daf6-434c-8eae-a685c9af1808/oauth2/v2.0/token
https://contoso.sharepoint.com/sites/testing/_api/Web?$select=Id,Url,RegionalSettings/*,RegionalSettings/DateFormat&$expand=RegionalSettings,RegionalSettings/TimeZone
… more calls of the SharePoint API

Note that

421cd8a4-daf6-434c-8eae-a685c9af1808 is the SharePoint tenant ID
b05e893b-866e-40d3-be10-75e44e5c38c2 is Entra ID application client ID
https://contoso.sharepoint.com is the SharePoint tenant URL

Unblocking above endpoints should make the authentication succeed.

Note

You might see calls to Google-related endpoints like https://accounts.google.com/ListAccounts - those are done by Chrome. They are not required by WikiTraccs and may safely be blocked, as long as Chrome keeps working.

Simulating Blocked Connections

Keep reading if you want to know how to simulate slow network connections or incomplete proxy configuration.

This is one of the value propositions of Dev Proxy: you can see how applications behave in unexpected circumstances.

I’d like to emulate a proxy that delays responses of certain endpoints related to Microsoft 365 authentication.

Let’s extend the configuration file for Dev Proxy as follows:

{
    "$schema": "https://raw.githubusercontent.com/dotnet/dev-proxy/main/schemas/v0.24.0/rc.schema.json",
    "plugins": [
        {
            "name": "UrlDiscoveryPlugin",
            "enabled": true,
            "pluginPath": "~appFolder/plugins/dev-proxy-plugins.dll",
            "urlsToWatch": [
                "https://*/*"
            ]
        },
        {
            "name": "LatencyPlugin",
            "enabled": true,
            "pluginPath": "~appFolder/plugins/dev-proxy-plugins.dll",
            "configSection": "latencyPlugin1",
            "urlsToWatch": [
                "https://login.microsoftonline.com/common/discovery/instance*"
            ]
        },
        {
            "name": "LatencyPlugin",
            "enabled": true,
            "pluginPath": "~appFolder/plugins/dev-proxy-plugins.dll",
            "configSection": "latencyPlugin2",
            "urlsToWatch": [
                "https://login.microsoftonline.com/421cd8a4-daf6-434c-8eae-a685c9af1808/oauth2/v2.0/token*"
            ]
        }
    ],
    "latencyPlugin1": {
        "minMs": 180000,
        "maxMs": 180000
    },
    "latencyPlugin2": {
        "minMs": 86400000,
        "maxMs": 86400000
    },
    "logLevel": "information",
    "newVersionNotification": "stable",
    "showSkipMessages": true,
    "showTimestamps": true
}

This configuration file adds the LatencyPlugin that slows down responses for endpoints. It is added two times, for two different endpoints, to set two different delays.

Endpoint one is https://login.microsoftonline.com/common/discovery/instance - this gets a response delay of 3 minutes (180000 ms).

Endpoint two is https://login.microsoftonline.com/421cd8a4-daf6-434c-8eae-a685c9af1808/oauth2/v2.0/token - this gets such a high response delay that it is equivalent with the proxy stalling forever.

Running Dev Proxy with above configuration has the following effects on WikiTraccs, both when testing the SharePoint connection and when starting a migration:

the browser tab for logging in to SharePoint Online will only appear after a 3-minute delay
after (seemingly) successful authenticating with SharePoint Online, WikiTraccs will wait for 100 seconds and run into a timeout, never completing the authentication

While waiting, WikiTraccs will appear stuck, not giving any feedback at all. This is something that needs to be addressed in a future update.

PnP.PowerShell

A quick aside about PnP.PowerShell which you might use in Microsoft 365-related projects.

Note

If you are not using PnP.PowerShell, yet, you should! It’s great for scripting everything related to SharePoint Online and Microsoft 365 in general.

Under the hood, WikiTraccs uses the same authentication mechanism as PnP.PowerShell.

This is why the Dev Proxy configuration from the last section will also make PnP.PowerShell fail when connecting.

Here’s the PnP.PowerShell command that corresponds to what WikiTraccs does when connecting to SharePoint Online:

Connect-PnPOnline -Url https://contoso.sharepoint.com/sites/testing -LaunchBrowser -ClientId b05e893b-866e-40d3-be10-75e44e5c38c2 -Interactive -Tenant 421cd8a4-daf6-434c-8eae-a685c9af1808

In this command:

-Url https://contoso.sharepoint.com/sites/testing is the SharePoint site to connect to
-LaunchBrowser tells it to open a tab in a browser where you can re-use an existing login
-ClientId b05e893b-866e-40d3-be10-75e44e5c38c2 is the Entra ID application client ID; this can be the same you use for WikiTraccs
-Interactive says that you’ll sign in with account credentials
-Tenant 421cd8a4-daf6-434c-8eae-a685c9af1808 is the tenant ID to use; you can copy that from the blue WikiTraccs.GUI window

So, if you ever want to test how your PowerShell scripts behave in “special” circumstances - you now know how.

Wrap

In this post we looked at the API endpoints that WikiTraccs uses when logging in to Microsoft 365 and how to assemble this list using Microsoft’s Dev Proxy tool.

As an aside we also looked at how to use Dev Proxy to simulate “special” network conditions and hint at how to test PowerShell scripts under similar conditions.

Migrating Confluence Cloud Whiteboards

This post provides information about whiteboard migrations and how they are expected to look like.

By Heinrich Ulbricht | Monday, February 17, 2025

Note

This post is only relevant for Confluence Cloud.

Whiteboard migration can be enabled in the WikiTraccs settings:

This will export Confluence whiteboards as image to SharePoint.

It’s a Gray Setting which means it might not be fully supported by a vendor.

Whiteboard Export is Not Supported by Atlassian

Let’s get that out of the way. Exporting whiteboards in a programmatic, automated manner is not possible at the moment.

The only way to export a whiteboard as image is to click the right buttons in the browser.

Here is one of the many whiteboard-related issues in the Atlassian Jira: Ability to export/import whiteboards (CONFCLOUF-76634). Please vote.

So…

How Does WikiTraccs Export Whiteboards?

WikiTraccs can export whiteboards as images.

It does it exactly like any user would do: by clicking the right buttons.

That means, WikiTraccs knows how the current Confluence Cloud user interface looks, how to find the right buttons, and how to click them, to start the image export.

After clicking the export button, it grabs the exported image from the download folder and that’s it.

Sounds rickety? It kind of is.

The export works, as long as the user interface does not change.

It can break if Atlassian changes the user interface of Confluence Cloud.

This video shows how it looks like, when WikiTraccs exports a whiteboard (32 seconds):

The following happens in this video:

WikiTraccs discoveres a whiteboard to migrate
WikiTraccs opens a remote-controlled browser and navigates to the whiteboard (note: I had to move the browser window over from the second monitor)
WikiTraccs clicks all the right buttons to export the whiteboard
Chrome saves the downloaded image as Whiteboard.png
WikiTraccs migrates Whiteboard.png to SharePoint, as page attachment

Everything that happens inside the browser is done by WikiTraccs, there was no user interaction (apart from moving the browser into view).

The resulting SharePoint page shows the exported image:

(Note: the whiteboard image can be huge, depending on the size of the original whiteboard. Clicking the image in SharePoint will expand it. It can also be downloaded to view it in full size.)

What Could Go Wrong?

As long as Atlassian doesn’t change the Confluence Cloud UI in a way that changes the buttons needed for the export, the process is pretty stable (although quite slow). So far, it broke about once in three months.

What sometimes happens is that remote-controlling the Chrome browser window fails for some reason. Might be a Chrome update, might be some Windows shenanigans. If that is the case, close WikiTraccs and all browser windows and restart the migration.

Note

You can restart the migration at any time. When starting a migration, WikiTraccs always checks what’s there in Confluence, what’s there in SharePoint, and it will migrate what is missing in SharePoint.

What sometimes happens is that Atlassian is showing some advertisement for a new feature in the browser. Often, this ad is an overlay that is being shown until you manually dismiss it by clicking the X or Cancel. This might interfer with the export.

If there is an overlay or dialog being shown that blocks the export, simply close it. The whiteboard migration might continue, but restarting might be necessary as well.

If the export breaks, WikiTraccs probably will require an update.

Under the Hood

To find the right button to click in the Confluence Cloud user interface, WikiTraccs uses CSS and XPath selectors.

You find those in a TOML file that is located at WikiTraccs.Console\Templates\BrowserAutomation\export-whiteboard.toml.

It contains selectors like this one:

[selectors.three-dot-menu]
description = "Three-dot menu in the upper right corner of a page"
language = "css"
query = "#more-actions-trigger"

This selector describes how to find the three dot menu button for a Confluence page.

There is a chance that, if whiteboard export breaks due to user interface changes, adjusting those selectors can fix the issue.

This approach to user interface automation is often used when programmatically testing user interfaces of web applications. WikiTraccs makes use of that approach to export whiteboards.

Authentication

Since WikiTraccs needs to open the whiteboard in the browser, this approach only works with Interactive Login where an account is logged in to Confluence to look at the whiteboard. It won’t work with Personal Access Tokens.

Wrap

In this blog post we looked at the current state of whiteboard exportability in Confluence Cloud, at how WikiTraccs does it and how it could break.

We all agree that this is a temporary solution and we hope that Atlassian provides an official export API in the near future (although I personally don’t believe that).

Harnessing the New SharePoint Page Format

This post hightlights the changes Microsoft did to SharePoint pages over in 2024 and how WikiTraccs adapts to that.

By Heinrich Ulbricht | Wednesday, February 05, 2025

In 2024, Microsoft upgraded the internal page format for SharePoint modern pages. The new format comes with an updated page editor and new formatting capabilities.

This change rolled out over the course of 2024 and now all new SharePoint pages use the new format.

But what about older pages?

How Microsoft Upgrades Older Pages

When a SharePoint page is edited in the browser, a (Microsoft-owned) script processes the page and - if required - upgrades it to the new page format.

This page upgrade happens only when a page is edited in the browser, so you can see the result immediately. During the upgrade, the styling of page elements might change.

What Microsoft doesn’t do is bulk-upgrading pages in the SharePoint backend. The upgrade only happens on a page-per-page basis when editing a page.

Here’s a sample of how the page upgrade looks:

This is what happens in the video:

a SharePoint page is open in the browser
the user clicks the Edit button to edit the page
the content of the text web part briefly disappears
the content of the text web part reappears, now upgraded

This video also illustrates one of the older (and now resolved) issues that could happen during page upgrade (the table ends up way too narrow).

Nowadays SharePoint also shows a progress message:

Page Upgrade Issues

There were several issues with the way Microsoft upgraded pages that were created in the pre-2024 format, e.g. with tables looking awful after the upgrade (as can be seen in above video), or lists that couldn’t contain images anymore. Most issues seem resolved.

One remaining issue affects larger pages with images.

When upgrading a SharePoint page, the upgrade script converts embedded images to a new format. Unfortunately, this process sometimes fails.

Note

This issue only affects the image display on pages. The actual image files (page attachments) are always present and available.

What has usually helped so far is to wait a bit and avoid scrolling while the page upgrade is in progress. Alternatively, trying a different browser or closing and reopening the browser before editing a long page can help (I suspect memory might be an issue during page upgrade).

Essentially, keep trying until it works.

Before each new try, always revert the page to the previous page version. Reverting to the previous version makes the images reappear. After that, the page can be edited again, and hopefully, the upgrade script will complete successfully.

A Microsoft support ticket might also help, but I’ve had to do this a few times with regard to SharePoint pages, and it’s usually a slow process with moderate success. So far repeated attempts always resolved the issue.

Note

This issue should be gone with the new page format.

What’s New in the New Pages?

In my view, the Text web part is the one that gets the most out of this page upgrade.

It gained the following new capabilities:

line height settings
more versatile spacing settings (space before, space after)
new color options
different styles for citations
… and more

Overall, the new pages appear visually more balanced and appealing than their older counterparts.

How Does WikiTraccs Handle Old and New Pages?

WikiTraccs up until version v1.23 creates pages in the older format.

Going forward, since the new SharePoint page format seems to have stabilized, WikiTraccs will create pages in the new format.

Creating pages in the new format is opt-in.

How to Opt-In to New Pages?

Use WikiTraccs release v1.24.0 or newer and make sure to UNcheck Create SharePoint pages with pre-2024 type in the WikiTraccs Settings:

While creating pages in the new format is currently an opt-in setting, it will be the default in a future WikiTraccs release.

For now, you have to explicitly choose to have new pages created.

Wrap

Creating SharePoint pages in the new page format is a big change for WikiTraccs and allows using the full set of features that those pages bring.

Please report any issues that you see with the new pages.

Splitting a Space

This post explains how to split a space and migrate the parts to different target sites.

By Heinrich Ulbricht | Friday, January 17, 2025

Assume the following use case where a space needs to be split into multiple parts:

“We have a Confluence space that contains information contributed by seven different teams. Each team’s content needs to be migrated to a separate SharePoint site.”

Let’s assume that all pages belonging to one team have a common parent page. That’s 7 parent pages for 7 teams.

How to migrate each of the 7 parent pages and all their decendants from Confluence to 7 different target sites in SharePoint?

With WikiTraccs, there are two approaches to selecting parts of a space for a migration:

the Content ID selector
the CQL selector

We’ll look at the Content ID selector in depth.

Use the Content ID Selector

With the Content ID selector you give WikiTraccs a list of page IDs to migrate.

Terminology note: What is a “page”?

How you assemble the list of page IDs is up to you. With Confluence Server and Data Center a common approach is via database query, in the Cloud other approaches are warranted.

In this section we are looking at:

a sample tree to illustrate migrating a subtree of pages
how to assemble a list of page IDs from the Confluence database
how to properly format those page IDs
how to create a Content ID selector entry in the Confluence Space Inventory list
how the Space Inventory looks after configuration

One way to get page IDs is from the Confluence database using SQL queries.

Consider the following page tree. How to get all page IDs of the Special Content page and all its descendants (because we want to migrate those)?

First we need the starting point, the page ID of the Special Content page, which is the root of the subtree.

To get the page ID for any page, open the Page Information page:

Then look at the browser bar that should show something like this: https://contoso.com/confluence/pages/viewinfo.action?pageId=65604.

The subtree root page ID in our case is 65604.

Now we can run the following SQL query on the Confluence database to get the list of descendants:

Note

Replace 65604 in the following snippet with your subtree root page ID.

SELECT c.contentid,
       c.title
FROM content c
FULL OUTER JOIN confancestors ca ON ca.descendentid = c.contentid
WHERE ca.ancestorid = '65604'
    AND c.content_status = 'current'
    AND c.prevver IS NULL;

The result might look like this, but could also look different, depending on the tool that is being used:

WikiTraccs expects those IDs to be in a specific format, though. We use an adjusted SQL query to get the list of IDs in the expected format:

SELECT count(*), string_agg(contentId || ';#' || contenttype, ',') AS contentIdSelectorValue
FROM (
	SELECT c.contentid,
		   LOWER(contenttype) AS contenttype,
		   c.title
	FROM content c
	FULL OUTER JOIN confancestors ca ON ca.descendentid = c.contentid
	WHERE ca.ancestorid = '65604'
		AND c.content_status = 'current'
		AND c.prevver IS NULL
	ORDER BY contenttype, contentId
) subquery;

The result now looks different:

This result contains the number of pages (16) and the list of IDs, including their type, in the expected format:

65608;#page,65611;#page,65635;#page,65637;#page,65640;#page,65642;#page,65644;#page,65646;#page,65648;#page,65650;#page,65652;#page,1179651;#page,5472257;#page,5472259;#page,5472261;#page,17924097;#page

Note

Make sure to update the space inventory before proceeding by clicking the Update space inventory and WikiTraccs site button in the blue WikiTraccs window:

This makes sure that the space inventory list exist and that it contains space information you can build on.

This step is also included in the quick start guide, so if in doubt, refer to that: Getting started.

We can now create the content ID selector in the Confluence Space Inventory.

Click Add new item to invoke the respective dialog:

You need to fill the following fields:

Field	Meaning	Sample Value
Title	Not used, but it’s a mandatory field; enter anything	dummy
WT_In_CfSiteId	Confluence base address; look at the other entries in the space inventory list to see what to enter	https://contoso.com/confluence
WT_Setting_RequestTransformation	Check this box to migrate this selector	☑
WT_Setting_ContentSelectorValue	The list of IDs in the expected format	65608;#page,65611;#page,65635;#page,65637;#page,65640;#page,…
WT_Setting_TargetSiteRootUrl	SharePoint target site for migrated pages	https://contoso.sharepoint.com/sites/ContentIDSelectorDemoTargetONE

All other values can be left empty, as those are not required for the Content ID selector.

The space inventory list, after adding two Content ID selectors, might look like this:

Two things are important as well:

Don’t forget to add the ancestor page to the ID list of the selector as it is NOT part of the SQL result above.
Remove the entry for the space your are splitting from the space inventory. Otherwise link translation might get confused and link to the wrong target site, as pages would be included in two selectors at the same time.

Now start a migration.

WikiTraccs should migrate the pages indicated by each Content ID selector to the respective target page. Links will be transformed as well, based on the target site URL configuration.

Use the CQL Selector to Choose Pages for Migration

The CQL selector works similar to the Content ID selector, but instead of giving WikiTraccs a list of page IDs you give it a CQL query - a Confluence search query.

Using a CQL selector works just like the Content ID selector:

create a new item in the space inventory list
enter field values as described in the last section; there’s only one difference
instead of the list of IDs you enter the CQL query into the WT_Setting_ContentSelectorValue field

Using a CQL query, you can refer to Confluence content by criteria you define. Let’s create a simple CQL query that refers to all pages that have the label migration set:

The CQL query that finds those pages is label = migration.

You should always test that your CQL query works, before using it as a selector for WikiTraccs. You’ll spot errors in the query much faster.

Testing CQL queries in Confluence is a bit cumbersome as you have to use the REST API. Assuming your Confluence base address is https://contoso.com/confluence, the address for testing above CQL query is https://contoso.com/confluence/rest/api/content/search?cql=label = migration. Navigate to this address and the result should show a list of pages in JSON format:

Have a look at the CQL selector documentation to learn more details and read about some caveats associated with this selector type.

Configuring the Page Tree in SharePoint

If you choose to migrate a subtree of pages and you opted to add the WikiPakk page tree to the game, it will work well.

After migrating a subtree, open the migrated root page. Then open the page tree panel and click Move to tree root.

The subtree root will now always be shown when opening the page tree panel.

Note that this only works when migrating a subtree as this will also migrate the page hierarchy within the subtree. Migrating a list of arbitrarily picked pages won’t result in a proper tree in SharePoint.

Testing WikiTraccs for Markdown

This post documents the current state of WikiTraccs for Markdown, highlights a sample repository, and describes how to test publishing

By Heinrich Ulbricht | Thursday, January 09, 2025

This topic is part of WikiTraccs for Markdown and work in progress.

Join the waitlist and start publishing Markdown to SharePoint soon

The last blog post provided some background on how WikiTraccs for Markdown came to be.

This blog post shows how to publish Markdown files to SharePoint Online.

Sample Repository

We use a sample knowledge repository to test publishing Markdown to SharePoint Online. There is one available in our library: AI for Beginners Sample Repo.

This repository is a copy of one of Microsoft’s repositories about AI and contains lots of Markdown files, images, and folders:

This is not an ideal structure, but taking something “from the wild” seems like a good test. WikiTraccs will make educated guesses about IDs, parent-child relationships and so on.

For testing, download the whole library repository as zip file and extract it. Then use the ai-for-beginners-sample-repo folder as base address for your test.

Publishing Markdown to SharePoint Online

Starting with release v1.23.6, WikiTraccs supports file pathes as “Confluence” base address.

Enter the path to the ai-for-beginners-sample-repo folder:

Note

Naming throughout WikiTraccs is still focused on Confluence as sole data source. This will be adjusted in a later release.

Make sure to activate the Skip Connection Checks setting as those are not required:

Then proceed as if you would migrate from Confluence:

Update the space inventory
Choose sources in the space inventory
Start the migration

WikiTraccs should now start publishing Markdown to SharePoint Online.

Limitations and Work Items

This is a non-exhaustive list of things that need to be done before WikiTraccs for Markdown is ready for production use:

auto-disable connection checks for Markdown source
don’t use the folder path as source “Site ID”, as this makes migrating from different migration machines impossible
optimize speed for larger repositories; currently it’s rather slow
the page hierarchy is not yet properly migrated for the sample repo
adjust the user interface to be “Confluence-free”
figure out where to put images and files that are referenced by multiple pages; images currently are handled as “external” and duplicates are created per referencing page
provide documentation on the expected source repository format and possible configuration options
figure out a good format for IDs (numbers? GUIDs?) and how to make maintenance easy (e.g. preventing duplicates)

And probably more.

Finding the Confluence Version

How to find the version of a Confluence instance or site.

By Heinrich Ulbricht | Wednesday, January 08, 2025

Time and again the question comes up: which Confluence version am I running on?

One way of finding the Confluence version is as easy as scrolling down on any Confluence page. The version might be written there:

If you don’t see any version information in the page footer you might be using a theme that hides that information.

You can always resort to the /rest/applinks/1.0/manifest endpoint that shows the Confluene version as well:

if your Confluence is available at https://confluence.contoso.com the address to navigate to would behttps://confluence.contoso.com/rest/applinks/1.0/manifest
if your Confluence is available at https://contoso.com/confluence the address to navigate to would behttps://contoso.com/confluence/rest/applinks/1.0/manifest

Navigate to the address in a browser where you are logged-in to Confluence and the page that opens shows the Confluence version:

Here the Confluence version would be 8.7.1.

This also works in Confluence Cloud. The version number for Confluence Cloud always start with 1000.

WikiTraccs for Markdown

This post introduces WikiTraccs for Markdown

By Heinrich Ulbricht | Friday, January 03, 2025

This topic is part of WikiTraccs for Markdown and work in progress.

Join the waitlist and start publishing Markdown to SharePoint soon

When demoing Confluence to SharePoint migrations with WikiTraccs, I sometimes get approached with something different in the context of a broader strategy shift.

Client are consolidating their knowledge repositories to cut costs, reduce complexity, and to be AI-ready.

In-house CMS systems are being sundowned, Confluence content is being moved to SharePoint Online (but only for so long).

One goal is to break free of any specific system for documentation and knowledge management, while still having the choice to publish to any service to make information available to users.

Clients are switching to Markdown.

So far, WikiTraccs transforms Confluence pages to SharePoint pages.

Here’s the question that I get asked: Can WikiTraccs also transform Markdown to SharePoint pages?

Soon, the answer will be Yes.

Publish Markdown to SharePoint Online

WikiTraccs for Markdown will be your tool to publish from Markdown to SharePoint.

Many other (free) tools are available to create, edit, and publish Markdown. WikiTraccs will help you with publishing to SharePoint Online.

I’m currently gathering information on how a Markdown-based knowldge base repository can compete with third-party services like a Confluence Enterprise wiki. Read more about that here: Strategy Shift to Text Files for Knowledge Repositories.

Proof of Concept

WikiTraccs for Markdown is currently in a proof of concept phase.

It can already pick up Markdown files from the files system and publish those to SharePoint Online, including page attachments and images.

WikiTraccs can already publish this Microsoft AI starter repository to SharePoint: AI for Beginners.

Here’s a page from that GitHub repository, as shown on GitHub:

And here’s the SharePoint page published by WikiTraccs:

Both pages are based on this Markdown file: README.md.

The Proof of Concept already raised some questions, among those are:

What’s a common layout for the Markdown-based knowledge repository?
How many pages and files might such a repository contain?
How are attachments and pages related; how common is image-reuse?
Is page attachments still the way to go in SharePoint, or should attachments be stored in separate Document Libraries?
While Markdown files are perfect for being stored in a version control system, how about large attachment files?

Get in touch if you’d like to provide your thoughts or want to learn more.

Making SharePoint Tables Look Pretty

This post shows how to use colored and merged table cells in SharePoint Online tables, and how to enable this for WikiTraccs.

By Heinrich Ulbricht | Wednesday, December 18, 2024

Update

As of April 2025 you can merge table cells in the SharePoint page editor. Microsoft added support for that.

Update 2

As of May 2025 you can set the table cell background color and configure the border style.

So, all table formattings applied by WikiTraccs are finally also available in the browser-based SharePoint page editor.

You cannot create tables in SharePoint Online that have colored or merged cells.

(The blog post ~~could~~ should end here.)

Outdated

The following information is outdated.

But this is not entirely true. What is true is that, when creating a new SharePoint page, there are no tools that you could click in the text web part’s tool bar. There is ~~no “merge cells” button and~~ no color picker for cell backgrounds.

Nevertheless, it is possible to copy a table from Microsoft Word and paste it to a SharePoint page. Now, if that is officially supported (and I don’t know if it is), WikiTraccs could use the underlying mechanism as well.

This is why WikiTraccs introduces the Use non-standard table transformation setting, and we’ll look at why it’s named like that further down.

Copying tables from Word to SharePoint preserves styles

Take this table for example, created in Word Online:

This table can be copied to the clipboard:

And from the clipboard, the table can now be pasted into a SharePoint page, preserving its style:

Apparently copying and pasting a table from Word carries over cell colors and merged cells. And even more important: those styles stay in place when editing and saving the SharePoint page.

I had multiple clients reach out to me asking for this to be added to WikiTraccs.

WikiTraccs adds support for cell colors and merged cells

Note: this is available as of WikiTraccs v1.22.5.

WikiTraccs can now transform Confluence tables such that cell colors and merged cells are preserved.

~~This feature is opt-in, you have to activate it in the WikiTraccs settings.~~ Note: This feature is on by default starting with WikiTraccs v1.24.20.

Here’s how to de/activate the feature:

in the blue WikiTraccs.GUI window, click Settings in the menu bar to open the Settings dialog
in the Settings dialog, in the Migration tab, un/check the option Use non-standard table transformation

Is this supported by Microsoft?

I don’t know if those copied-over tables are officially supported, as end users can not create them in the text web part editor. Copy and pasting seems to be the only way.

At the same time, I heard from clients that they use this functionality and also that this seemed to be advertised by Microsoft as a feature some time ago.

Why am I cautious?

I’m cautious because page content that cannot be created in the browser editor, by the user, by clicking buttons, might cease to work with a future update (by Microsoft) of the text web part. This might also affect how pages behave when being edited; styles might change or content even disappear.

Will that be the case in the future? I don’t know. I hope not.

Is there a way out if something happens in the future? There certainly is, by modifying page contents via PowerShell.

As long as there is this uncertainty, the setting is placed in the Gray Settings area of the Settings dialog, and it carries the non-standard hint in its name.

Visual comparison

Here’s a colorful Confluence table that already appeared in a blog post:

Table in Confluence

Here’s how that looks in SharePoint with the new setting enabled:

Table in SharePoint, now with colored cells

And here’s how the table would normally look, with the new setting disabled:

Table in SharePoint, new setting disabled

Another Confluence table that uses merged cells:

Table in Confluence

Here’s how that looks in SharePoint with the new setting enabled:

Table in SharePoint, now with merged cells

Here’s how the table would normally look, with the new setting disabled:

Table in SharePoint, new setting disabled

Note that in above image the presence and the direction of cell merging is hinted at by arrows.

Overall, real merged cells are a lot more pleasant to look at.

Wrap

In this blog post we looked at a way to create tables with colored and merged cells in SharePoint Online modern pages, by copying and pasting styled tables from Word.

We also looked at a new WikiTraccs setting that can leverages this behavior.

There is no documentation by Microsoft (that I know of) that describes this as an official feature and words of caution have been written.

If somebody can find official documentation by Microsoft that this is an actual feature that is officially supported, please let me know.

How much time will a Confluence to SharePoint migration take?

This post looks into one of the most asked questions: how long will the migration take?

By Heinrich Ulbricht | Saturday, December 14, 2024

The migration time depends heavily on the source content in Confluence, and the network speed, both for downloads and uploads.

We’ll do some test migrations to have real-world data to look at.

Summary (TL;DR)

Here are the average migration times per page, taken from test migrations detailed further down this post:

07 seconds (~2h per 1000 pages) (note: few attachments)
11 seconds (~3h per 1000 pages)
12 seconds (~3h per 1000 pages)
12 seconds again
13 seconds (~4h per 1000 pages)
13 seconds again
15 seconds (~4h per 1000 pages)
20 seconds (~6h per 1000 pages)
35 seconds (~9h per 1000 pages) (note: lots of attachments)

Read further to get details.

How does the source content influence migration speed?

The following list shows migration activities that are always present and can take quite some time:

creating a page in SharePoint and setting up the metadata around it
uploading files to SharePoint Online (mainly on slow connections)
downloading files from Confluence (mainly on slow connections)

Here are some activities that also take time, but are not always needed, depending on the source content:

resolving Confluence links - WikiTraccs will check every link it finds to know where the link links to, to properly transform the link
resolving @-mentions - this involves looking up user information, writing this to the user mapping table, and trying to find the user in SharePoint
transforming Jira macros - this involves reaching out to Jira; Confluence’s Jira integration is rather slow

If you migrate pages with only text content and no attachments they’ll migrate in no time.

If you migrate pages with 500 attachments each, it might take minutes for each page to be migrated.

Using live migration metrics to get insights

Note

Those metrics are only available in Migrate content mode.

With release v1.22.1, WikiTraccs introduced live metrics that show how long migrating pages take (those are also written to the log file). Those metrics also provide insights into which of the mentioned activities - downloading and uploading content, resolving links, etc. - happen, and how much time they take.

You can read more about those live metrics here: Live Progress Indicators.

Since the migration times depend on the source content, we need some source content to perform actual migrations, to get actual data to look at.

Thankfully, there are plenty of open Confluence sites available to run some measurements; some run by hobbyists, many by universities, some by non-profits and NGOs.

Testing migrations using open Confluence sites

We’ll pick some of those open Confluence sites, not based on the meaning of their content, but on the amount and type of their content. I’m not affiliated with any of those sites. They just provide a great way to test one thing or another.

Note: over time the accessibility of those sites will change. Some will close, some will move. The number of Confluence on-prem instances decreases, the number of Confluence cloud sites increases.

Note

“Open site” means a Confluence site on the internet that has anonymous access enabled, so that content is freely accessible by everyone visiting the site, and also indexed and discoverable by search engines like Google.

Again: those sites are purely chosen by googling for site:atlassian.net and clicking through sites until finding some that have a decent amount of spaces and pages.

Migration machines used for test migrations

Test migrations were performed on the following migration machines:

“Mig VM”

An Azure VM with the following characteristics:

Operating System: Windows 10
Size: B2s
vCPUs: 2
RAM: 4 GB
Max IOPS: 1280
Cost per month: €32.80
Download speed: 1200 Mbps
Upload speed: 1800 Mbps

“Mig Laptop”

A laptop:

Operating System: Debian Linux
vCPUs: 6
RAM: 40 GB
Download speed: 93 Mbps
Upload speed: 38 Mbps

Download and upload speed were determined using one of the many free internet sites that measure those values.

Confluence Cloud Site: University of Texas at Austin

The UT is providing us with an site that has over 300 spaces, many small, but two bigger ones.

Migration Metrics for Space 1

Migration Machine: Mig VM
Migrated number of pages: 801
Migration duration: 92 minutes

Here are the metrics (as logged into the common log files) 1 hour into the migration, based on data of this hour, covering 410 pages:

Median time (IQR)    : 4 s/item
Median time          : 4 s/item
75th percentile (IQR): 7 s/item
75th percentile      : 9 s/item
Mean time (IQR)      : 5 s/item
Mean time            : 8 s/item
Avg links transformed: 2/item
Have files           : 22% of items
File count avg       : 2.3 per item that has files
File size avg        : 1649 KB/file
Download speed       : ~485 KB/s
Upload speed         : ~373 KB/s
File size sum down   : 1299.6 MB
File size sum up     : 1293.5 MB

Top activities were:

(SharePoint, Page, Content)                                 : 28,52 min
(SharePoint, WikiTraccs, Prerequisites)                     : 14,98 min
(Confluence, Page, Link_Soft)                               : 14,69 min

There are only a few attachments in this space, only every fifth page has about 2 attachments. This is probably the reason why file upload and download are not in the top activities.

Most present are activities that require a lot of calls to APIs, like checking prerequisites, prepping the page, setting its content, and setting metadata for page and attachment folder.

In the end, looking at the overall migration duration and number of pages, the average migration duration was 7 seconds per page.

Migration Metrics for Space 2

Migration Machine: Mig VM
Migrated number of pages: 1233
Migration duration: 12h 10m

Here are the metrics (as logged into the common log files) 1 hour into the migration, based on data of this hour, covering 140 pages:

Median time (IQR)    : 11 s/item
Median time          : 13 s/item
75th percentile (IQR): 27 s/item
75th percentile      : 31 s/item
Mean time (IQR)      : 17 s/item
Mean time            : 25 s/item
Avg links transformed: 15/item
Have files           : 56% of items
File count avg       : 13.6 per item that has files
File size avg        : 1235 KB/file
Download speed       : ~297 KB/s
Upload speed         : ~218 KB/s
File size sum down   : 425.6 MB
File size sum up     : 427.5 MB

Top activities were:

(SharePoint, File, Content)                                 : 17.19 min
(SharePoint, Page, Content)                                 : 10.21 min
(Confluence, File, Content)                                 : 9.86 min

Compared with the first space, pages in this space have a lot of attachments. Half of the pages have an average of 13 attachments, each about 1,5 MB in size. That’s why the download and upload activities take more time in this space. All page metrics that measure time are about triple of what they were for the first space.

Remember, those values are measured over a period of 60 minutes. Let’s look at how the values changed after 4 hours of migration, covering 89 page-like contents:

Median time (IQR)    : 29 s/item
Median time          : 29 s/item
75th percentile (IQR): 47 s/item
75th percentile      : 50 s/item
Mean time (IQR)      : 35 s/item
Mean time            : 40 s/item
Avg links transformed: 3/item
Have files           : 91% of items
File count avg       : 19.6 per item that has files
File size avg        : 499 KB/file
Download speed       : ~495 KB/s
Upload speed         : ~357 KB/s
File size sum down   : 919.3 MB
File size sum up     : 957.0 MB

Top activities were:

(SharePoint, File, Content)                                 : 24.83 min
(Confluence, File, Content)                                 : 14.17 min
(SharePoint, Page, Content)                                 : 7.61 min

All times went up. The number of pages having attachments went from 56% to 91% - those file downloads and uploads take time.

In the end, looking at the overall migration duration and number of pages, the average migration duration was 35 seconds per page.

Confluence Cloud Site: Geocaching.com Wiki

This site had one space.

Migration Metrics for Space

Migration Machine: Mig Laptop
Migrated number of pages: 262 (whole site)
Migration duration: 53 minutes

Metrics for all 262 pages:

Median time (IQR)    : 5 s/item
Median time          : 5 s/item
75th percentile (IQR): 12 s/item
75th percentile      : 14 s/item
Mean time (IQR)      : 8 s/item
Mean time            : 12 s/item
Avg links transformed: 4/item
Have files           : 35% of items
File count avg       : 2.5 per item that has files
File size avg        : 382 KB/file
Download speed       : ~173 KB/s
Upload speed         : ~96 KB/s
File size sum down   : 234.7 MB
File size sum up     : 234.7 MB

Top activities were:

(SharePoint, Page, Content)                                 : 22.15 min
(Confluence, Page, Link_Soft)                               : 12.91 min
(Confluence, Page, Transformation_Macro_Other)              : 12.12 min

In the end, looking at the overall migration duration and number of pages, the average migration duration was 12 seconds per page.

Confluence Cloud Site: Xilinx Wiki

This site had one space.

Migration Metrics for Space

Migration Machine: Mig VM
Migrated number of pages: 1284 (whole site)
Migration duration: 7h 22m

Here are the metrics (as logged into the common log files) 1 hour into the migration, based on data of this hour, covering 221 pages:

Median time (IQR)    : 10 s/item
Median time          : 10 s/item
75th percentile (IQR): 18 s/item
75th percentile      : 21 s/item
Mean time (IQR)      : 14 s/item
Mean time            : 15 s/item
Avg links transformed: 7/item
Have files           : 61% of items
File count avg       : 6.7 per item that has files
File size avg        : 1419 KB/file
Download speed       : ~724 KB/s
Upload speed         : ~434 KB/s
File size sum down   : 1264.3 MB
File size sum up     : 1264.3 MB

Top activities were:

(SharePoint, File, Content)                                 : 22.25 min
(Confluence, Page, Transformation_Macro_Other)              : 20.55 min
(SharePoint, Page, Content)                                 : 15.97 min

In the end, looking at the overall migration duration and number of pages, the average migration duration was 20 seconds per page.

Confluence Cloud Site: Duke Universtiy

This site had 120 spaces.

Migration Metrics for whole Site

Migration Machine: Mig Laptop
Migrated Migrated number of pages: 3117 (whole site)
Migration duration: 12h 46m

Here are the metrics (as logged into the common log files) 1 hour into the migration, based on data of this hour, covering 330 pages:

Median time (IQR)    : 5 s/item
Median time          : 5 s/item
75th percentile (IQR): 11 s/item
75th percentile      : 13 s/item
Mean time (IQR)      : 8 s/item
Mean time            : 10 s/item
Avg links transformed: 2/item
Have files           : 43% of items
File count avg       : 2.6 per item that has files
File size avg        : 421 KB/file
Download speed       : ~239 KB/s
Upload speed         : ~104 KB/s
File size sum down   : 463.5 MB
File size sum up     : 463.5 MB

Top activities were:

(SharePoint, Page, Content)                                 : 23.05 min
(SharePoint, File, Content)                                 : 15.38 min
(SharePoint, WikiTraccs, Prerequisites)                     : 11.04 min

Let’s look at how the values changed after 4 hours of migration, covering 110 page-like contents:

Median time (IQR)    : 14 s/item
Median time          : 14 s/item
75th percentile (IQR): 59 s/item
75th percentile      : 59 s/item
Mean time (IQR)      : 32 s/item
Mean time            : 32 s/item
Avg links transformed: 0/item
Have files           : 65% of items
File count avg       : 17.8 per item that has files
File size avg        : 399 KB/file
Download speed       : ~444 KB/s
Upload speed         : ~212 KB/s
File size sum down   : 567.1 MB
File size sum up     : 587.2 MB

Top activities were:

(SharePoint, File, Content)                                 : 35.10 min
(Confluence, File, Content)                                 : 15.23 min
(SharePoint, Page, Content)                                 : 10.87 min

In the end, looking at the overall migration duration and number of pages, the average migration duration was 15 seconds per page.

Confluence Cloud Site: Harvard University

This site had 46 spaces.

Migration Metrics for whole Site

Migration Machine: Mig VM
Migrated Migrated number of pages: 6647 (whole site)
Migration duration: 20h 23m

Here are the metrics (as logged into the common log files) 1 hour into the migration, based on data of this hour, covering 211 pages:

Median time (IQR)    : 7 s/item
Median time          : 9 s/item
75th percentile (IQR): 15 s/item
75th percentile      : 17 s/item
Mean time (IQR)      : 10 s/item
Mean time            : 17 s/item
Avg links transformed: 6/item
Have files           : 50% of items
File count avg       : 4.2 per item that has files
File size avg        : 822 KB/file
Download speed       : ~371 KB/s
Upload speed         : ~199 KB/s
File size sum down   : 1200.9 MB
File size sum up     : 1200.9 MB

Top activities were:

(SharePoint, Page, Content)                                 : 14.74 min
(SharePoint, File, Content)                                 : 13.92 min
(Confluence, Page, Link_Soft)                               : 10.44 min

Let’s look at how the values changed after 4 hours of migration, covering 325 page-like contents:

Median time (IQR)    : 4 s/item
Median time          : 4 s/item
75th percentile (IQR): 9 s/item
75th percentile      : 11 s/item
Mean time (IQR)      : 7 s/item
Mean time            : 11 s/item
Avg links transformed: 2/item
Have files           : 35% of items
File count avg       : 2.6 per item that has files
File size avg        : 558 KB/file
Download speed       : ~165 KB/s
Upload speed         : ~105 KB/s
File size sum down   : 340.8 MB
File size sum up     : 340.7 MB

Top activities were:

(SharePoint, Page, Content)                                 : 22.18 min
(SharePoint, File, Content)                                 : 12.34 min
(SharePoint, WikiTraccs, Prerequisites)                     : 11.77 min

In the end, looking at the overall migration duration and number of pages, the average migration duration was 11 seconds per page.

Confluence Cloud Site: Hyperledger Foundation

This site had 74 spaces.

Migration Metrics for whole Site

Migration Machine: Mig VM
Migrated Migrated number of pages: 9238 (whole site)
Migration duration: 32h 40m

Here are the metrics (as logged into the common log files) 1 hour into the migration, based on data of this hour, covering 390 pages:

Median time (IQR)    : 7 s/item
Median time          : 7 s/item
75th percentile (IQR): 9 s/item
75th percentile      : 9 s/item
Mean time (IQR)      : 7 s/item
Mean time            : 9 s/item
Items per hour       : 400-514 (based on IQR Median and IQR 75th percentile)
Avg links transformed: 3/item
Have files           : 62% of items
File count avg       : 1.1 per item that has files
File size avg        : 294 KB/file
Download speed       : ~679 KB/s
Upload speed         : ~94 KB/s
File size sum down   : 1618.1 MB
File size sum up     : 1619.3 MB

Top activities were:

(Confluence, Page, Link_Soft)                               : 53.22 min
(SharePoint, Page, Content)                                 : 29.33 min
(Confluence, Page, Transformation_UserMention)              : 17.99 min

Let’s look at how the values changed after 4 hours of migration, covering 239 page-like contents:

Median time (IQR)    : 14 s/item
Median time          : 14 s/item
75th percentile (IQR): 18 s/item
75th percentile      : 18 s/item
Mean time (IQR)      : 14 s/item
Mean time            : 15 s/item
Items per hour       : 200-257 (based on IQR Median and IQR 75th percentile)
Avg links transformed: 4/item
Have files           : 53% of items
File count avg       : 1.6 per item that has files
File size avg        : 370 KB/file
Download speed       : ~102 KB/s
Upload speed         : ~81 KB/s
File size sum down   : 93.1 MB
File size sum up     : 93.1 MB

Top activities were:

(SharePoint, Page, Content)                                 : 24.94 min
(SharePoint, File, Content)                                 : 13.86 min
(Confluence, Page, Link_Soft)                               : 13.01 min

In the end, looking at the overall migration duration and number of pages, the average migration duration was 13 seconds per page.

Confluence Cloud Site: logica.atlassian.net

This site had 19 spaces.

Migration Metrics for whole Site

Migration Machine: Mig VM
Migrated Migrated number of pages: 1494 (whole site)
Migration duration: 5h 9m

Here are the metrics (as logged into the common log files) 1 hour into the migration, based on data of this hour, covering 489 pages:

Median time (IQR)    : 4 s/item
Median time          : 4 s/item
75th percentile (IQR): 4 s/item
75th percentile      : 6 s/item
Mean time (IQR)      : 4 s/item
Mean time            : 7 s/item
Items per hour       : 900-900 (based on IQR Median and IQR 75th percentile)
Avg links transformed: 1/item
Have files           : 13% of items
File count avg       : 1.5 per item that has files
File size avg        : 372 KB/file
Download speed       : ~127 KB/s
Upload speed         : ~99 KB/s
File size sum down   : 294.0 MB
File size sum up     : 294.2 MB

Top activities were:

(SharePoint, Page, Content)                                 : 30.43 min
(Confluence, Page, Link_Soft)                               : 17.85 min
(SharePoint, WikiTraccs, Prerequisites)                     : 17.11 min

Let’s look at how the values changed after 4 hours of migration, covering 255 page-like contents:

Median time (IQR)    : 4 s/item
Median time          : 4 s/item
75th percentile (IQR): 7 s/item
75th percentile      : 11 s/item
Mean time (IQR)      : 6 s/item
Mean time            : 14 s/item
Items per hour       : 514-600 (based on IQR Median and IQR 75th percentile)
Avg links transformed: 1/item
Have files           : 30% of items
File count avg       : 2.9 per item that has files
File size avg        : 2069 KB/file
Download speed       : ~705 KB/s
Upload speed         : ~451 KB/s
File size sum down   : 13192.5 MB
File size sum up     : 13828.6 MB

Top activities were:

(SharePoint, Page, Content)                                 : 14.34 min
(SharePoint, WikiTraccs, Prerequisites)                     : 8.29 min
(SharePoint, File, Content)                                 : 7.54 min

In the end, looking at the overall migration duration and number of pages, the average migration duration was 12 seconds per page.

Confluence Cloud Site: vzvz.atlassian.net

This site had 26 spaces.

Migration Metrics for whole Site

Migration Machine: Mig VM
Migrated Migrated number of pages: 8541 (whole site)
Migration duration: 31h 45m

Here are the metrics (as logged into the common log files) 1 hour into the migration, based on data of this hour, covering 369 pages:

Median time (IQR)    : 6 s/item
Median time          : 6 s/item
75th percentile (IQR): 13 s/item
75th percentile      : 13 s/item
Mean time (IQR)      : 8 s/item
Mean time            : 9 s/item
Items per hour       : 276-450 (based on IQR Median and IQR 75th percentile)
Avg links transformed: 3/item
Have files           : 47% of items
File count avg       : 3.4 per item that has files
File size avg        : 271 KB/file
Download speed       : ~219 KB/s
Upload speed         : ~103 KB/s
File size sum down   : 173.9 MB
File size sum up     : 174.7 MB

Top activities were:

(SharePoint, Page, Content)                                 : 27.74 min
(Confluence, Page, Link_Soft)                               : 22.21 min
(SharePoint, File, Content)                                 : 20.38 min

Let’s look at how the values changed after 4 hours of migration, covering 255 page-like contents:

Median time (IQR)    : 10 s/item
Median time          : 10 s/item
75th percentile (IQR): 13 s/item
75th percentile      : 15 s/item
Mean time (IQR)      : 11 s/item
Mean time            : 13 s/item
Items per hour       : 276-327 (based on IQR Median and IQR 75th percentile)
Avg links transformed: 1/item
Have files           : 26% of items
File count avg       : 1.9 per item that has files
File size avg        : 533 KB/file
Download speed       : ~134 KB/s
Upload speed         : ~84 KB/s
File size sum down   : 68.5 MB
File size sum up     : 67.4 MB

Top activities were:

(SharePoint, Page, Content)                                 : 30.62 min
(SharePoint, WikiTraccs, Prerequisites)                     : 12.94 min
(Confluence, Page, Link_Soft)                               : 11.94 min

In the end, looking at the overall migration duration and number of pages, the average migration duration was 13 seconds per page.

Wrap

In this blog post we looked at some metrics from real test migrations to get a feeling about how long a migration can take.

Live metrics can only predict the future if we assume that pages are comparable in their characteristics (mainly number of attachments), which might not be the case. Thus, the metrics will change over the course of the migration as metrics are always calculated based on data from the last 60 minutes.

So far it seems that the metrics Mean Time (IQR) and 75th percentile (IQR) could be predictors for the overall migration time within a space.

Over time, I’ll add more samples to this blog post and might also update the values, as the metrics calculation evolves.

Confluence Quirks

This post is a collection of quirky stuff that Confluence throws at WikiTraccs.

By Heinrich Ulbricht | Saturday, November 30, 2024

When speaking about page migrations failing, I mean the things on this page. All those tripped up page migration at some point, those are now known and worked around by WikiTraccs (where possible).

This list is more like a “fun” documentation of things, I never thought could happen.

page with empty title; how is that even possible?
duplicate attachments:
attachment name that looks like a path; can’t be downloaded:
attachments that differ only by Unicode Normalization Form; names look the same, but are differently encoded; depending on the application that handles those files, they are treated as duplicate - or not
attachments that are listed in the attachment list, but are missing when trying to download them
attachments that are listed in the attachment list, but give an error when trying to download them:
- note that in above case the image preview for this image was missing as well, but clicking this “broken image” placeholder did show the image:
attachments names with special characters that are valid on Linux, but not on Windows
attachment file sizes reported by Confluence are wrong

This list will be extended as more quirks are uncovered.

Good Practices for your Confluence to SharePoint Migration

This post is about good practices that I picked up from clients.

By Heinrich Ulbricht | Friday, November 22, 2024

Often I get asked about best practices around Confluence to SharePoint migrations and WikiTraccs.

Please note that I am not directly involved in migration projects. If there is a perfect project where the client doesn’t need my help because WikiTraccs does its job, I’ll never hear from them.

What I hear about are issues. Issues with tooling, but also non-technical issues.

Recommendations

Here’s a collection of things I recommend.

Prepare users for SharePoint

I often hear phrases like “Leadership has decided” to proceed with moving content to SharePoint. However, some users may not have had any prior experience with SharePoint. Familiarize them with SharePoint or, at the very least, help them anticipate that SharePoint will be different from Confluence.

Test the migration

Perform a test migration using WikiTraccs. Evaluate the results. Purchase only, if the results match your expectations. Depending on how well your stakeholders know SharePoint, expectations vary wildly.

Expect layout changes. SharePoint is no Confluence. SharePoint doesn’t offer the same formatting and layouting features as Confluence. You’ll see that for example with images, where SharePoint is not capable of showing them side by side 😔.

Prepare for macros being gone

Most of the Confluence macros don’t exist in SharePoint. Identify key use cases that exist in Confluence and rely on specific macros or metadata. Manually try to rebuild those cases in SharePoint. WikiTraccs might help you with that, but you will also have to think about using extensions like PnP Search web parts. You’ll probably also use services outside SharePoint to re-implement those use cases, like the Power Platform, Loop, Teams, or others.

Prepare your service desk for hypercare questions

After the migration users will ask questions. Provide them with a channel to get those questions answered.

Example questions:

Why does the table of contents doesn’t update in SharePoint?
- Answer: because the list of links is static and the table of contents macro is gone.
Why are child pages not showing up on the page?
- Answer: because the children macro is gone.

Don’t do a permission migration

Confluence and SharePoint behave differently when it comes to permissions. Confluence can have page hierarchies. In Confluence, you can restrict pages to a more narrow circle of users with each hierarchy level. In SharePoint, there is just one level. Multiple levels of Confluence page restrictions need to be combined to just one SharePoint page.

For a more complex hierarchy of restrictions in Confluence it is nearly impossible to map that 1:1 to SharePoint.

That being said, WikiTraccs supports migrating permissions, to the extend it is possible. But a permission migration might give you a headache.

Archive older content to one site

Identify old Confluence content to archive and migrate this content to one SharePoint archival site.

Skip personal Confluence spaces

Prepare users to manually move content they need from their personal spaces.

Don’t change networking infrastructure during the migration

One client upgraded Confluence to TLS 1.3 during the migration. This caused issues in combination with Windows 10. Everything can be solved, but this can cause interruptions that should be prevented during a migration.

Plan all target sites before the migration

You should know which Confluence content will be migrated to which target SharePoint site. Enter target site URLs in the Space Inventory list. This is required for the link transformation to be successful throughout the migration as WikiTraccs will look up those sites when transforming cross-space links.

Note that WikiTraccs doesn’t create those sites, you create them, or choose from existing ones.

Check the WikiTraccs settings

The blue WikiTraccs window has a menu bar with a Settings option. Click that and click through the tabs in the Settings window to familiarize yourself with the settings. Some choices are to be made like whether blogposts should be migrated or not.

Use migration waves for larger space counts

You can define migration waves. If you have hundreds of spaces, it might be beneficial to not choose all at once for migration, but to give them wave numbers and migrate each wave separately, or even (some) in parallel.

Bring the page tree to SharePoint Online

The number one missing productivity feature in SharePoint with regard to pages is the page tree. WikiPakk finally brings the page tree to SharePoint.

Repeat migration runs

During longer migration runs there can be connection issues that prevent single pages to migrate. Re-run the migration to get those pages over as well.

With WikiTraccs, you can re-run the migration multiple times. Before each run, WikiTraccs checks which pages are missing in SharePoint and only migrates those that are missing. This proved beneficial especially in environments with an unstable internet connection.

Only when a migration run doesn’t seem to migrate any more pages you go ahead and look at the progress log files to check the final result.

Check the playbook

The Migration Playbook provides guidance about configuring WikiTraccs and evaluating migration results.

Additional topics

Some additional topics that often come up.

How many SharePoint sites do we need?

I cannot answer that as this is up to your information architecture.

What I often see is a 1:1 relationship between spaces and sites. So, all pages of a Confluence space are migrated to a target SharePoint site.

How long will the migration take?

This is hard to answer as it depends on a number of factors. The FAQ has some thoughts about that: How much time does it take to migrate?

Confluence Cloud Specialties

This post explains how certain content can only be migrated using workarounds and also lists the prerequisites for those workarounds to work.

By Heinrich Ulbricht | Monday, November 18, 2024

Note

This post is only relevant for Confluence Cloud.

To migrate content from Confluence Cloud to SharePoint Online, WikiTraccs needs to access content in Confluence Cloud to download it. Content means page content, attachments, metadata, macro details, new content types like whiteboards, and more.

There are two ways to get content from Confluence Cloud: using a programming interface that is officially supported by Atlassian, and using internal interfaces.

Ideally, only officially supported interfaces are used; for Confluence Cloud, that would be the REST API.

Since this post exists, you might already guess where this is leading. Not every content is accessible using official ways.

Content that can only be migrated via workarounds

There are Confluence elements which cannot be accessed using officially documented interfaces (as the Confluence Cloud REST API would be one).

WikiTraccs tries to work around those limitations.

Here’s the list of (currently known) elements that need a workaround to be migrated:

Jira issue lists
Whiteboards

Let’s look at each of those.

Jira issue lists

WikiTraccs takes static snapshots of Jira issue lists so that the migrated SharePoint page contains a static issue table as well.

In Confluence Cloud the endpoint to do that conversion is not officially accessible (WikiTraccs issue #123).

WikiTraccs works around this limitation by using the same endpoint that Confluence uses when you work in the browser.

Prerequisite: This workaround is only available to WikiTraccs when using Interactive Login. It is not supported when using API tokens or Anonymous access.

Whiteboards

Read this blog post to learn how WikiTraccs works around the missing Confluence Cloud export functionality at the ramifications: Migrating Confluence Cloud Whiteboards

Closing notes

WikiTraccs uses workarounds to migrate content that otherwise is not accessible to apps via official Atlassian interfaces.

Those workarounds are based on internal interfaces and they might break at any time since Atlassian is not required to announce changes. So, it might happen that a workaround stops working temporarily, might require a WikiTraccs update, or might stop working at all. There is nothing that can be done about that, apart from lobbying for Atlassian to provide proper interfaces for exporting all content.

WikiTraccs uses workarounds exclusively for content that otherwise would not be accessible at all and would thus be missing in migrated pages.

WikiTraccs switches to Confluence Cloud v2 REST API

This blog post outlines a bigger change in how WikiTraccs works with Confluence Cloud.

By Heinrich Ulbricht | Sunday, November 17, 2024

Note

The information in this post is relevant for Confluence Cloud only. There are no changes for Confluence on-premises.

Background

Confluence Cloud so far provided programming interfaces (APIs) that allowed applications like WikiTraccs to treat it as no different that Confluence Server or Confluence Data Center.

Those days are over.

Atlassian is removing critical parts of the v1 REST API (that provided access compatible with the on-prem Confluence versions). Instead, applications are now forced to use the v2 REST API, starting December 2, 2024.

Note

Atlassian extended the deadline to March 31, 2025, shortly after writing this post.

Here is the announcement by Atlassian:

What does this mean for users of WikiTraccs?

Ideally there should be no noticable difference. But the v2 API, in my view, is a step back in terms of developer convenience. More work has to be done to retrieve the same data as before.

From now on it takes both more time and more transmitted data to get the same work done as before.

WikiTraccs Update Required by December 2, 2024

Starting with version v1.21.3 WikiTraccs uses the new v2 REST API instead of deprecated v1 endpoints when talking to Confluence Cloud.

Starting December 2, 2024, using this or a newer WikiTraccs version is mandatory for Confluence Cloud to SharePoint migrations (unless Atlassian extends the deadline).

Until December 2, 2024, switching back to v1 API is possible. You can use the Force use of v1 REST API setting to switch back to v1:

If there are unforseeable issues, this setting can be used to check if those are caused by the switch to v2. It shouldn’t be necessary.

If you find issues, use any of the support channels to get in touch.

Migrated SharePoint Page Names Explained

This post goes into the naming scheme WikiTraccs applies to migrated pages.

By Heinrich Ulbricht | Saturday, November 16, 2024

SharePoint pages migrated by WikiTraccs all follow the same naming scheme. We’ll take a look at how those names look and why this scheme is applied.

How WikiTraccs names migrated pages

In contrast to Confluence, SharePoint pages are files and those files have a file name and a title. When migrating a Confluence page to SharePoint, WikiTraccs has to set both the file name and the title.

We’ll focus on the file name (also referred to as page name or just name).

Note

The naming scheme is only applied to the file name of SharePoint pages, NOT the page title. The page title (that is visible to the end user in most places) can have arbitrary values and is set to the original Confluence page title by WikiTraccs.

The SharePoint page file name will be derived from

the Confluence space key (soon: space alias)
the Confluence page title, and
the Confluence page ID.

For a space key of HR, a title of Onboarding, and a page ID of 123456789, the SharePoint page name will be HR-Onboarding-123456789.aspx.

This scheme WikiTraccs applies is fixed and cannot be changed.

Why WikiTraccs names pages like it does

The naming scheme presented in the previous section is applied so that WikiTraccs can create SharePoint links between pages, without having to actually migrate all pages that link to each other. It can infer page names from page metadata.

An example of two pages and a link

Let’s look at an example of two pages (space key, page title, page ID):

HR, Onboarding, 123456789
IT, Welcome Package, 987654321

The HR Onboarding page links to the IT Welcome Package page.

Now you task WikiTraccs to migrate all Confluence pages of space HR to SharePoint.

When migrating the HR Onboarding page, WikiTraccs needs to transform the (Confluence) link to the IT Welcome Package page to a SharePoint link.

SharePoint links in general

In general, SharePoint page links look like https://yourcompany.sharepoint.com/sites/<SITENAME>/SitePages/<PAGENAME>.

For each link to a Confluence page, attachment, or space, WikiTraccs needs to create the corresponding SharePoint link.

So, WikiTraccs has to figure out the SITENAME and PAGENAME parts of the each SharePoint link it creates.

WikiTraccs creates the SharePoint link

For the PAGENAME part of the SharePoint link, WikiTraccs applies the naming scheme that is the focus of this post. It knows that the IT Welcome Package page, once migrated, will get the page name IT-Welcome Package-987654321.aspx.

The SITENAME part of the link is looked up in the Space Inventory.

Having both PAGENAME and SITENAME, the final SharePoint link might look like this:

https://yourcompany.sharepoint.com/sites/ITS/SitePages/IT-Welcome Package-987654321.aspx

How to name pages differently?

Currently there is no way to change the SharePoint page file names that WikiTraccs creates during the migration as it relies on those names to creates links between pages.

Can I rename pages later?

After finishing all migration tasks, you might choose to rename pages via PowerShell. Since I did not conduct large-scale tests in this direction, I cannot say how well SharePoint handles renamed pages and especially linking to those renamed pages from other pages and web parts.

One sample of a working rename operation can be found in the WikiTraccs discussions.

Wrap

In this post we looked at how SharePoint pages are being named by WikiTraccs and how the naming scheme looks. We also looked into how the naming helps to create links between pages.

How to migrate only some pages?

This post show how to migrate some selected pages from Confluence to SharePoint, instead of a whole Confluence space.

By Heinrich Ulbricht | Wednesday, November 06, 2024

Note

When talking about pages, this also means blog posts and the new cloud content types like whiteboards. A more common term would be content, but that does sound a bit generic and would also cover attachments. So let’s keep it at pages where it fits.

With WikiTraccs, selecting pages for migration is always done in the Confluence Space Inventory list (short: space inventory).

The space inventory contains source selectors that tell WikiTraccs which pages to migrate.

In this blog post we’ll look at one specific source selector type that allows to select single pages for migration: the Content ID selector.

Note

Learn more about all available content selectors here: Confluence Space Inventory / Selectors.

How to tell WikiTraccs which pages to migrate, by content ID

Everything in Confluence - pages, blog posts, attachments, comments, you name it - has an ID: the content ID.

For now, let’s assume you already have a list of content IDs you want to migrate (We’ll look at how to get those IDs in the next section). You would mark them for migration by creating a new source selector in the space inventory, like this:

open the space inventory (note: this is a SharePoint list and can be used as such)
add a new item
enter the following data for the newly created item:

Field	Sample Value	Remarks
Title	notused	the title is ignored, but is a mandatory field; enter anything here
WT_In_SiteId	https://CHANGEME.atlassian.net/wiki	enter your Confluence base URL; look at the other items in the space inventory and copy it from there
WT_Setting_RequestTransformation	Yes	mark the selector for migration
WT_Setting_TargetSiteRootUrl	https://contoso.sharepoint.com/sites/target	enter the target SharePoint site here
WT_Setting_ContentSelectorValue	123456789;#page,987654321;#blogpost	this list of ID-type combinations tells WikiTraccs what to migrate; each value consists of the content ID (like 123456789) followed by the content type (page, blogpost, whiteboard, database, …), separated by ;# (like so: ID;#TYPE); multiple values are separated by comma (like so: ID;#TYPE,ID;#TYPE,ID;#TYPE)

the other fields are optional and can be left blank

When starting the next migration, WikiTraccs should process this selector and migrate the pages.

How to get content IDs and types?

Depending on your migration team’s level of access there are different ways to get content IDs and content types.

How to manually look up the content ID and type for a page

Any user that can view Confluence pages in the browser can do this.

Confluence Server and Data Center

In the browser, open the Confluence page you need the content ID for.

Open the Page Information for that page:

The browser will now show the page ID in its address bar; here it is 10387457:

The Page Information view and address bar will look similar for pages and blog posts. You can infer that you are looking at a blog post by looking at the breadcrumb, which contains a date:

Note: The Page Information view is also available in Confluence Cloud.

Confluence Cloud

In Confluence Cloud the content ID and type are always shown in the address bar of your browser when navigating pages.

Having open a page the address bar will look like this:

https://contoso.atlassian.net/wiki/spaces/SPACE/pages/123456789/Page+Title

The /pages/ part tells you that this is a page, and its ID is 123456789.

For a blog post this looks similar:

https://contoso.atlassian.net/wiki/spaces/SPACE/blog/2024/10/24/987654321/Blog+Title

The /blog/ part tells you that this is a blog post, and its ID is 987654321.

This works similarly for other content types like whiteboards.

How to use a space backup to look up content IDs and types

As Confluence system administrator, you can create a space backup. You’ll see content IDs and types in this backup.

Each space backup is a zip file that contains an entities.xml file.

You’ll find information about all pages (of the backed-up space) in this xml file:

The same for blog posts:

How to use SQL queries to look up content IDs and types (on-premises only)

This option is the most flexible one, but can only be applied with administrative access to the database server backing Confluence.

Have a look at this recipe on how to use SQL to get a list of pages for a space: Getting a list of pages per space [on-premises].

Use cases for the Content ID selector

The Content ID selector has several benefits:

it allows selecting content to migrate based on criteria you define
it works whether or not the migration account is included in page restrictions; this is a difference to the CQL selector
it can work around the issue that some Confluence instances show with space selectors: Confluence might misreport space contents

The drawback of course is that you manually have to assemble the list of IDs. Using the space selector to migrate whole spaces is much more convenient.

Wrap

In this post we created a Content ID selector to migrate two selected pages. We also learned how to get the content IDs for pages. Depending on the permissions of the migration team, there are different ways to do that.

Confluence vs. SharePoint - Part 1: Overall structure

In this post we look at how content is structured in both Confluence and SharePoint.

By Heinrich Ulbricht | Wednesday, October 30, 2024

Note

When talking about Confluence pages, blog posts are also meant, because they are basically pages with another name. When looking at the cloud there is more than pages and blog posts but let’s keep things simple by looking at what is at the core of Confluence: pages.

Confluence

When looking at a Confluence page the basic structure looks like this:

A Confluence page

The following parts make up a Confluence page:

Page Content, including
- Versions
- Inline Comments
- Inline Tasks
- @-Mentions
- Macros
Attachments, including
- Versions
- Comments
Footer Comments
- like page content…
Restrictions
Metadata, including
- Title
- Creator, Author
- Creation Date, Modification Date
- Likes
- Labels

Each page lives in a Confluence space where each space can contain hundreds, thousands, or even tens of thousands of pages.

Pages within a space usually form a hierarchy. This means there are parent pages, child pages, and leaves, like a tree:

A Confluence space contains many pages

Access to pages can be restricted at each level of the tree, which also affects all child pages. If a user is not allowed to access a parent page, they cannot access any of the child pages as well.

Confluence can have many spaces. A typical instance has dozens or a couple of hundred space, but there’s also instances with a couple of thousand spaces.

The following parts make up a Confluence space:

Pages
Metadata
- Title
- Description
- Archived or not
Restrictions

Spaces have no hierarchy:

Confluence organizes content in spaces

SharePoint

While Confluence consists of spaces, SharePoint consists of sites, which serve a similar purpose.

The following parts make up a SharePoint site:

Lists (note: those are like a table)
- List Items
  - Item Data in List Columns
  - List Item Permissions
- List Permissions
Document Libraries (note: those are like a drive or folder)
- Files
  - File Metadata in List Columns
  - File Permissions
- Document Library Permissions
Site Permissions
Apps

A SharePoint site

Some SharePoint document libraries are “special” in that they serve a specific purpose. With regard to pages, there’s the Site Pages library and the Site Assets library.

All SharePoint pages are stored in the Site Pages library. For SharePoint, pages are just files with some metadata. There is no page hierarchy and all pages are stored flat in the Site Pages library.

Page attachments are stored in the Site Assets library, in a folder that belongs to the page. Each page has its own folder.

When setting the permissions for a SharePoint page, SharePoint takes care of setting those permissions on the page’s folder as well.

How does it map?

Often Confluence spaces are compared to SharePoint sites.

I’d say this is a fair comparison, when focusing on wiki functionality.

Confluence spaces and SharePoint sites have some things in common. Both:

have a title and description
have owners
can be access restricted to users and groups
contain wiki pages and files
have their content indexed by search
have no strict technical hierarchy (bear with me on this one)
have their permissions inherited by pages, but pages can also have their own

Confluence pages and SharePoint pages also have some things in common. Both:

support the basic text formatting capabilities, like bold, headings, tables, images, etc.
can be created from templates
can have sections to structure page content
can contain “blocks of functionality” - macros in Confluence, web parts in SharePoint

Here are the areas where Confluence and SharePoint differ:

Confluence	SharePoint Online
pages form a hierarchy (with parent and child pages)	pages are flat, without a hierarchy
page breadcrumb navigation above pages	no page breadcrumb navigation
pages have attachments	pages have associated files in a special folder (we can make it look like attachments using a web part)
attachments (files) are bound to pages	files are stored in document libraries and linked to by pages
pages seem a bit better suited for documenting knowledge	pages seem a bit too much focused on presenting content nicely
macros can be nested	web parts cannot be nested
page restrictions have a hierarchy	item level permissions for pages have no hierarchy
rich app marketplace	marketplaces are not Microsoft’s strength
pages support @-mentioning other users	no @-mention support on pages
comments under pages with rich formatting, deeply nested	plain text comments under pages, 2 levels
inline comments on pages	no inline comments
inline tasks on pages	no inline tasks
integration with Jira	no integration with Jira
pages can easily be moved to other Confluence spaces (due to their self-contained nature)	pages cannot easily be moved to other SharePoint sites (at least not without breaking links or attached metadata)

Confluence Cloud introduces some new restrictions compared to Confluence Server and Data Center, for example with regard to formatting content and nesting macros.

Confluence Link Types Explained

This post shows the different link types in Confluence and highlights the difference between soft links and hard links. It covers the relevance of knowing link types and variations for migration tools.

By Heinrich Ulbricht | Thursday, October 24, 2024

Migration tools like WikiTraccs needs to take care of Confluence links, because otherwise those are broken after a migration.

WikiTraccs will translate links within Confluence to proper SharePoint links. This includes page links, space links, and attachment links.

There is one challenge for migration tools when analyzing Confluence links and that is properly detecting both the two link types (soft links and hard links), and the different hard link variations.

What’s the difference between soft links and hard links?

I get that asked quite a lot. From an end user perspective the difference normally is not visible.

Here is an example of one soft link and one hard link, both linking to another Confluence page, as seen by an end user:

Can you spot the difference?

The difference can only be seen when looking at the page’s storage format, which looks like this:

<ac:link>
   <ri:page ri:space-key="linktarget" ri:content-title="Get the most out of your team space"
      ri:version-at-save="1" />
   <ac:link-body>Get the most out of your team space</ac:link-body>
</ac:link>
<a
   href="//wikitransformationproject.atlassian.net/wiki/spaces/linktarget/pages/60162329/Get+the+most+out+of+your+team+space">Get
   the most out of your team space
</a>

This is how tools like WikiTraccs “see” the page when migrating it to SharePoint Online, or processing it in any other way.

The first link - a soft link - is represented as ac:link element that has metadata attached that allows WikiTraccs to look up information about this page.

The soft link contains readily available metadata, like the target space key (linktarget) and the title of the page (Get the most out of your team space). This is easy to handle for a tool like WikiTraccs. And most importantly, Confluence is able to keep this link up-to-date even if the target page changes.

The second link - a hard link - is a plain HTML link. It also points to the target page, but uses the absolute page address to do so.

The hard link doesn’t contain easily accessible page metadata. If you look closely you’ll see information like the space key, page ID, and so on; those certainly help, but there are many variations of those links, and we look at those further down. Confluence will not keep hard links up-to-date, so, if the target page changes, the link might be broken.

Note

Note that both soft and hard links are HTML elements, <a> and ac:link respectively.

A piece of text that looks like a link, but is neither <a>, nor <ac:link>, is not a hard link. It’s just text content that happens to look like a link. WikiTraccs won’t touch those as they might be part of some documentation and transforming them would introduce errors.

How are hard links created?

Let’s create one.

Ideally, there shouldn’t be hard links at all. When pasting a link onto a Confluence page, Confluence analyzes the link, detects the content it links to, and creates a proper soft link.

Nevertheless, hard links are often present in more complex content that has been copy and pasted into pages. Confluence’s hard link conversion does not always kick in.

Here’s a sample of how to fool the hard link detection and create a soft link and a hard link:

Above animation shows the following:

we copy a link to a Confluence Cloud page to the clipboard
we paste the link as plain text
Confluence recognizes that this is a hard link and converts it to a soft link
we change the settings of that soft link to look like a regular link
we paste the link a second time
we edit the second link to remove the https: part; the result is still a proper and valid hard link, but it fools Confluence’s logic and it leaves the hard link in place
we save the page

The result is a page containing one soft link and one hard link.

How is it relevant if links are hard links or soft links?

It is relevant if you migrate content from one Confluence environment into another environment.

So, it is relevant for those use cases:

Confluence Cloud site to site migrations
Confluence Data Center to Confluence Cloud migrations
Confluence to SharePoint Online migrations
…

In all those cases, links between migrated pages will be broken, as well as links from external systems, like Jira descriptions and comments.

To fix those broken links, you need to:

identify the links
modify the links to point to the new location (or redirect)

Knowing that hard links and soft links exist is crucial for identifying the links in the first place.

Ideally, migration tools cover that, so you don’t have to care. WikiTraccs covers this.

What hard link variations are there?

We used a very simple hard link example so far (from Confluence Cloud):

wiki/spaces/linktarget/pages/60162329/Get+the+most+out+of+your+team+space

This link contains lots of metadata to look up the target content:

content type (page)
content ID (60162329)
space key or alias (linktarget)

Here’s a list of other hard link variations:

wiki/display/Test (space link)
wiki/spaces/Test (space link, cloud)
wiki/pages/viewpage.action?pageId=10162329 (page link)
wiki/pages/viewpage.action?pageId=10162329&pageVersion=1 (link to page version)
wiki/x/xYBtAQ (tiny page link)
wiki/display/Test/news (page link)
wiki/spaces/Test/blog/2024/10/24/181542831/news (blog post link, cloud)
wiki/spaces/Test/blog/181542831 (blog post link, cloud)
wiki/spaces/Test/pages/edit-v2/2123259301 (page edit link, cloud)
wiki/pages/viewpageattachments.action?pageId=60162247&preview=/60162247/174891441/image.png (attachment view link)
wiki/download/attachments/10162247/image.png (attachment download link)
wiki/spaces/Test/overview?homepageId=3020326337 (page link)

Wrap

In this post we learned about soft link and hard links, and looked at different hard link variations.

When moving content from one Confluence environment to another environment (Confluence, SharePoint Online, …) links to pages, attachments, and spaces will break. Those links need to be taken care of, either by tools like WikiTraccs, or proxy solutions that recognize those links and redirect to the new location.

Mapping user accounts from Confluence to SharePoint

This blog post shows a basic user mapping.

By Heinrich Ulbricht | Monday, October 14, 2024

In this blog post we’ll create a Confluence page, look at author and editor, migrate the page, configure user mappings, look at the resulting SharePoint page, and update the page’s metadata - to demonstrate how to map users from Confluence to SharePoint.

We start with at a single Confluence source page.

Confluence Source Page

This Confluence page was created by Admin, and last edited by Parzival.

Here are those users in the Confluence administration:

Those users are local Confluence users, so not synchronized with Entra ID or other external directories.

However, note that the email address of Parzival corresponds to the email address of their Entra ID user. We’ll come back to that fact at a later point.

Migrating the Source Page to SharePoint

We now migrate the page to SharePoint.

When migrating pages, WikiTraccs looks at each page and “collects” user accounts from metadata fields and page restrictions.

Information about those user accounts will be stored in the Confluence User and Group Mapping (WikiTraccs) table, in the WikiTraccs site in SharePoint:

This list contains information like:

Principal Type (user or group)
Display Name
Internal Name
Key
Email Address (note: only if one could be gotten)
Principal Source (note: the source is either onprem for Confluence Server and Data Center, or cloud for Confluence Cloud; it does not refer to specific user directories or their location)

The column WT_Setting_MapForDataAndMentions is where the corresponding Entra ID account has to be entered.

Filling in the WT_Setting_MapForDataAndMentions column is a manual task, but can also be automated via PowerShell. Clients also had success filling that via Excel.

There is an automapping feature built into WikiTraccs, though. WikiTraccs tries to find a matching Entra ID account based on the Confluence account’s email address.

That is why (in above screenshot) the mapping for Parzival is already filled in; this account has the same email address in Confluence and Entra ID.

For all other accounts the mapping has to be filled in manually (by you).

Looking at the Migration Result

What we’ve got so far is:

one page migrated from Confluence to SharePoint
the page author in Confluence is Admin, page editor is Parzival
the user mapping for Parzival has been automapped by WikiTraccs in the Confluence User and Group Mapping (WikiTraccs) table
the user mapping for Admin is yet to be done
the migration account used to log in to SharePoint was WikiTraccs

The metadata of the resulting SharePoint page will look like this:

The creator of the SharePoint page is the Entra ID account WikiTraccs, not Admin. That is because there is no mapping configured for Admin, yet. So, the creator is set to the account doing the migration.

The editor of the SharePoint page is set to Parzi Val, which is the Entra ID account that matches (via email address) the Confluence account Parzival.

We now want to update the mapping for Admin.

Updating Mappings, Updating Pages

So far everything is going as planned.

We migrated some content and got a mapping to do.

Let’s configure the mapping for the Confluence Admin account which we map to the Heinrich Entra ID account:

But how to update the SharePoint page?

WikiTraccs has an update mode built-in for this specific case. In the blue WikiTraccs window, choose it in the Settings dialog:

Update mode that updates page metadata based on configured account mappings

Leave all other settings in place and start the migration again.

WikiTraccs will now run over all migrated pages where a mapping was missing and update the page metadata.

How does WikiTraccs know where a mapping was missing? Those pages have been marked via the Check Principal Mapping column, in the Site Pages library:

Where to find the Check Principal Mapping column?

Open your target SharePoint site in a browser. Click the Settings cog wheel in the upper right corner, then click Site Contents:

A list of the site’s lists and document libraries opens. It contains the Site Pages document library, where all pages are stored.

Click Site Pages (or the translated version like Websiteseiten) to open the library.

The Site Pages library default view does not show the Check Principal Mapping column, yet. Let’s show it.

First, switch to the Recent Pages (WikiTraccs) view:

Now, in the header row of the document library, click + Add column, then Show or hide columns, then find and check the WT: Check Principal Mapping (WikiTraccs) column:

Click Apply to save and close. The column is now visible and can be used.

After successfully updating author and editor metadata of a page (and only then), WikiTraccs removes this check mark. (Note: If you ever want to force an update of author and editor, set the check mark again before running the migration in update mode.)

After WikiTraccs is finished with the update mode run the SharePoint page should now look like this:

Creator and author are properly set to what was configured via the Confluence User and Group Mapping (WikiTraccs) table.

Confluence Authentication Overview

This post summarizes the different Confluence authentication options that WikiTraccs supports, with a focus on navigating some harder cases.

By Heinrich Ulbricht | Sunday, August 25, 2024

When starting a Confluence to SharePoint migration with WikiTraccs, you need to authenticate with Confluence so that WikiTraccs can access content.

You’ll need a Confluence user account for the migration

You’ll need a Confluence user account to log in with (“migration account”). WikiTraccs’ access to Confluence happens as this user.

WikiTraccs does not support “application permissions” where it would access Confluence as “itself”. WikiTraccs will always access Confluence in the context of a user account.

Anonymous access

There is one exception to the rule that you always need to use a migration account, and that is a Confluence instance with anonymous access enabled.

Anonymous access is good for testing or demo purposes. The quick start tutorial also uses that with a publicly available Confluence instance.

Anonymous access is not recommended for production migrations as certain metadata cannot be retrieved, like information about users (name, email address) and permission configuration.

Interactive Login is the easiest and most compatible authentication type.

WikiTraccs opens a browser window that you’ll use to log in to Confluence with the migration account. WikiTraccs takes over the user session and all requests to Confluence will be made in the context of the migration user account.

Here is more information: Cookie-based authentication / Interactive login.

The session cookies that WikiTraccs uses to take over the user session are:

JSESSIONID
seraph.confluence
tenant.session.token (Confluence Cloud)
_shibsession* (Shibboleth identity provider)

In case your identity provider needs additional cookies to identify a logged in user, click the Advanced Settings button and enter those cookie names:

If there are issues with interactive login, like WikiTraccs not being able to start the Chrome browser for interactive login, refer to the troubleshooting article dedicated to getting around those issues: Confluence Authentication Issues.

Personal Token

The Personal Token authentication type uses a special kind of password (“token” or “key”) to log in. This password is created in Confluence in the context of a Confluence user account.

Please refer to this article for configuration details: Personal Access Token.

Confluence Server and Data Center just need the token

Confluence Cloud needs the name as well

The Personal Token is also referred to as Personal Access Token, API Token, or API Key.

Confluence 7.9 started supporting this authentication type.

In Confluence Cloud some transformations may not avilable with this authentication type (Jira Issue List).

The harder cases…

What are hard cases with regard to Confluence authentication?

Here are some hallmarks of a hard case:

Interactive Login authentication does not work immediately
Personal Token authentication is not available because Confluence is older than version 7.9 or because of multi-factor or other requirements
the cookies.txt workaround to provide session cookies manually for Interactive Login does not work

Over time, I encountered the following hard cases:

Kerberos prevented WikiTraccs from taking over the session cookies from Interactive Login; only connections made from the Chrome browser were considered valid by the Confluence backend
WikiTraccs once could not detect a successful login to Confluence and was thus not able to get the session cookies for Interactive Login; cookies.txt also did not work; reason unknown
Confluence could only be connected to using TLS 1.3-secured connections; Windows and the Confluence backend could not negotiate a mutual TLS 1.3 cipher to use; this issue does not affect the Chrome browser as Chrome brings its own cryptographic stack that operates independently of Windows

Note that those cases were the exception and not the rule.

Nevertheless, those cases needed to be handled as some of them were introduced by configuration changes that were made while migrating and thus blockers. So, WikiTraccs got something called the proxy mode.

Proxy Mode

The proxy mode is a workaround for all of the above issues with the Interactive Login authentication type.

Normally, connection and authentication issues tend to be absent when a user logs in to Confluence in the browser, but start appearing once WikiTraccs starts talking to Confluence directly.

If those issues cannot be resolved, there is only one way left: go back to using the browser to connect to Confluence.

That’s what the proxy mode does. WikiTraccs will keep an automated Chrome browser window open and route all requests to Confluence right through that browser.

This comes with some drawbacks as well. Routing everything through the browser is slower. Also, the proxy mode is not perfectly polished, as it is only thought as a workaround. So WikiTraccs might get confused at times with regard to automating the Chrome browser window, which can be resolved by closing and reopening all windows.

But overall the proxy mode is like the silver bullet to authentication issues.

You activate the proxy mode in WikiTraccs Settings -> Misc -> Proxy Confluence API calls through browser:

Summary

When it comes to authenticating with Confluence those are the steps to take:

In WikiTraccs:

Choose the Interactive Login authentication type and test connecting; if that doesn’t work immediately:
Choose the Personal Token authentication type and test connecting; if that isn’t available:
Use the cookies.txt workaround and test connecting using the Interactive Login;
- if that does work: good; find the missing cookie names and enter them in the Advanced Settings dialog
- if that doesn’t work:
Use the Proxy Mode in combination with Interactive Login

If your environment blocks connections to certain endpoints, it might also be necessary to provide the Chrome Driver manually. WikiTracccs uses the Chrome Driver to automate the Chrome browser.

General considerations:

check if the migration account can be enabled for Personal Token authentication
make sure all endpoints can be reached by WikiTraccs, e.g. to download the Chrome driver needed to automate the Chrome browser for Interactive Login
make sure the Confluence backend and Windows can work together with regard to supported TLS versions and ciphers

Usually, none of the workarounds are necessary. But now and then they are needed. That’s why they are available.

And if nothing works - get in touch and we’ll have a look.

Confluence might misreport space contents

This post highlights a challenge some Confluence instances pose - they misreport the content of spaces.

By Heinrich Ulbricht | Thursday, August 22, 2024

There is one specific issue that customers have now and then: not all pages of a space are being migrated. Some pages will be missing in SharePoint.

Why is that?

While there can be multiple reasons, one is Confluence itself misreporting the contents of spaces.

Let’s look at the root issue and how WikiTraccs tries to work around it.

The root issue: Confluence lies about its space contents

If the Confluence issue is present, it affects the most convenient type of WikiTraccs’ source content selectors, the Space Selector.

When WikiTraccs starts migrating a space it asks Confluence about the contents of this space, like “give me all page IDs in this space”.

Confluence will happily answer and the list of page IDs it returns might look like this: 00001, 00002, 00003, 00004, 00005, 00006. This would be 6 pages to migrate. That’s what we expect.

But sometimes the result looks different, although all 6 pages are definitely there. Confluence might report the list of page IDs like this: 00001, 00002, 00002, 00002, 00005, 00006.

Notice the difference? Page ID 00002 is listed three times, while 00003 and 00004 are missing.

This is a problem. Why is Confluence lying to us? I don’t know.

Note

Over the course of two years I got the impression that at least 10% of Confluence on-premises instances are affected. Not sure about cloud, yet.

WikiTraccs tries to work around this issue

The latest release of WikiTraccs contains a workaround for this Confluence issue.

WikiTraccs detects duplicate page IDs and will take that as a hint that page IDs will be missing as well. It will then use a different method to retrieve the page IDs for a space.

One caveat of this workaround is that it’s significantly slower than just getting the list of page IDs handed by Confluence. But at least it should retrieve a complete list.

How to verify that all pages have been migrated?

When Confluence is not lying to us, WikiTraccs’ progress log files are the way to go. The __30-aggregated-info file shows a summary of a space’s migration progress. This is the happy path.

If you have a hunch (or see it in the logs) that Confluence might be lying about the contents of a space, your only chance is to look at and compare with the Confluence database.

Here’s how to get a list of page IDs from the Confluence database, for a given space: Getting a list of pages per space.

Compare the list of page IDs you got from the database with the list of pages WikiTraccs got handed by Confluence. The pages WikiTraccs knows about for a space can be seen in the __25-update-state-of-migrated-pages progress log file.

Hints about duplicates in the WikiTraccs log file

The WikiTraccs common log files contain information about duplicate page detection and applied workarounds.

You want to find this message for each space that has been selected for migration:

No duplicate IDs found for selector (Type=ConfluenceSpaceKey; Query=GOOD)

Above log message says that all is fine for space with key GOOD, as there were no duplicate page IDs. Also, WikiTracccs takes this as a hint that there won’t be page IDs missing (note: this is an assumption that is yet to be proven wrong).

To the contrary, the following log message indicates that duplicates where found for space with key DUPE:

Duplicate content IDs found for selector (Type=ConfluenceSpaceKey; Query=DUPE)

Searching for the text [DUPLICATES] in the WikiTraccs log files will surface further details about the affected spaces, like which pages are affected and which pages could only be retrieved via the built-in workaround.

Working around the issue with the Content ID Selector

The issue only affects Space Selectors as for those WikiTraccs will ask Confluence about space’s contents. Consequently Confluence might choose to lie to us.

To prevent this kind of issue, you might choose Content ID Selectors instead to tell WikiTraccs the page IDs it should migrate. With this type of selector you take the “page ID retrieval” part in your own hands. Have a look at the documentation about the details.

In general, Space Selectors are easier to handle than Content ID Selectors. So in an ideal world, there would be no need to choose one over the other to work around Confluence issues.

How to fix this Confluence issue?

I don’t know. Let me know if you find a solution.

Can feature 'XYZ' be added to WikiTraccs?

This post outlines how I add new features to WikiTraccs and how we can help each other.

By Heinrich Ulbricht | Monday, August 19, 2024

It depends.

Adding XYZ to WikiTraccs vs. project-specific solutions

The development of WikiTraccs is mostly driven by customer demand. That, and changes by Microsoft and Atlassian to their services.

Every environment and migration project is different, but there are similar challenges. If customers (or I) face the same challenge over and over again, it’s usually time to think about extending WikiTraccs to support the underlying use cases.

New WikiTraccs feature?

I’m open to extending WikiTraccs if the new feature helps with an often-faced challenge that is not isolated to a specific migration project.

New WikiTraccs features must:

✅ work in a standard SharePoint Online environment, without having to install third-party apps
✅ solve a challenge for multiple clients
✅ have a technical solution
✅ take reasonable time to implement

If some of those boxes cannot be checked then the solution is usually project-specific, although we should definitely talk this through.

Project-specific solution?

Project-specific solutions solve challenges for a specific migration project or a specific environment.

Modifying the page header to comply with CI/CD? Turn off the comments section on pages? Add page metadata for HR? Those are unique requirements within a project.

Project-specific solutions usually involve a combination of configuration (search configuration, content types, managed metadata), low-code solutions (Power Automate workflows), and scripting (PowerShell, JavaScript).

Missing feature XYZ is a blocker for me - can you add it to WikiTraccs?

I’d like to learn more. Does the feature check all of above boxes? That’s a good start.

The less time it takes to implement a feature, the more likely it is that I’ll add it for you. The more time it takes, the more useful it has to be in a broader sense.

I consider anything below ~2 days of end-to-end development effort as relatively low effort. Adding transformations for yet-unknown but simple-to-handle macros fall into that category. Or adjustments to text placeholders.

Note that I cannot give a guarantee that or when a feature will be added.

That being said - I’m always open for feature proposals as this is the way for me to learn about real-world challenges.

If we haven’t been already in touch via a demo or email or GitHub, please get in touch!

Help me help you

When looking at your Confluence to SharePoint migration challenge, I need to understand it first. Please help me with that.

Screenshots help illustrate your case

Screenshots are a good way to start understanding migration results.

Something looks unexpected after migrating pages to SharePoint? Take a screenshot of the source Confluence page and a screenshot of the migrated page in SharePoint. Highlight your expectations in the screenshots or write some lines about what you expect.

Create the expected outcome manually in SharePoint

Help me see the SharePoint page as you envision it to look after the migration.

Create a new modern SharePoint page and edit it. Manually add all the necessary elements to make it resemble the source Confluence page as closely as possible. Use only out-of-the-box SharePoint web parts - no PnP, no third-party tools (I nevertheless like to hear about those as well! Maybe it’s the exception to the rule).

Send me a screenshot of both the source Confluence page and target SharePoint page.

This ensures that it is technically possible to achieve what you’d like to see on the SharePoint side and serves as a mockup for me to understand your requirements.

Furthermore, It would help a great deal if you could send me the SharePoint storage format of the page. Here’s how to get that: Get the SharePoint Storage Format

Send me the Confluence page’s storage format XML

There is a Confluence page that contains something that you’d like to see migrated differently? Like a macro that ends up being a placeholder in SharePoint?

Send me a screenshot of the Confluence page.

I also need to see the page how WikiTraccs sees it to estimate how much effort it takes to support new transformations.

Send me the storage format XML. Here’s how to get that: Get the Confluence Storage Format.

The storage format XML contains a page’s text content, structure, formatting, and most importantly information about all macros, including their parameters.

Closing note

Please review the existing feature proposals in the GitHub issue list and vote for those you’d like to see gain traction. Alternatively, you can create a new feature proposal or start a discussion.

Converting Gliffy and draw.io to SVG

This blog post shows how to convert popular diagramming solutions to SVG.

By Heinrich Ulbricht | Monday, July 22, 2024

Motivation

The motivation behind this blog post is a customer request:

When migrating from Confluence to SharePoint, how to migrate Gliffy diagrams as SVG files?

Unfortunately, WikiTraccs cannot do much in this regard. WikiTraccs migrates Confluence page attachments, which include the diagram file (from Gliffy or draw.io) and the diagram’s preview image. Read the previous blog post for details on that.

At least you end up having the diagram file in SharePoint Online.

Having the diagram file in SharePoint is a good thing, because it gives us the option to work with those diagrams even when Confluence is not around anymore.

How to export diagrams as SVG files

Let’s look at our options to export diagrams as SVG.

Gliffy => SVG

When Confluence is still around, you can export from Confluence:

Open Confluence page
Edit Gliffy diagram
File… > Export > SVG

After having migrated to SharePoint Online, you need to work with the diagram file instead:

Open a migrated SharePoint page
Download the migrated Gliffy diagram file from the page’s attachments
Open app.diagrams.net (a free online diagram software) in browser

Warning

The next step uploads your diagram file to app.diagrams.net.

Drag the diagram file into the browser
File > Export as > SVG…

draw.io => SVG

When Confluence is still around, you can export from Confluence:

Open Confluence page
Edit draw.io diagram
File > Export as > SVG…

After having migrated to SharePoint Online, you need to work with the diagram file instead:

Open migrated SharePoint page
Download migrated draw.io diagram file from the page’s attachments
Open app.diagrams.net?offline=1 in browser (a free online diagram software), in offline mode
Drag the diagram file into the browser
File > Export as > SVG…

Note that this doesn’t seem to upload the diagram file as it already has the draw.io format.

Privacy Considerations

When Confluence is not around anymore and you end up using app.diagrams.net (note: which is the only free tool I know of that can convert Gliffy to draw.io), there is one fundamental difference between Gliffy and draw.io files.

The app.diagrams.net online editor needs to convert Gliffy diagrams to draw.io diagrams first. This is done at a remote location. This means your diagram files will be uploaded to a third party.

This is contrary to draw.io diagram files. When loading those into the app.diagrams.net online editor, no upload seems to happen.

draw.io Desktop (free)

There is an alternative to app.diagrams.net and that is the draw.io Desktop application. You can download it at drawio.com.

draw.io Desktop seems to have the same capabilities as app.diagram.net. There is one important exception: the desktop version cannot convert Gliffy files to the draw.io format. Which is unfortunate when you only have Gliffy files that you want to convert to SVG.

There is no free offline tool available that I know of that can convert from Gliffy to draw.io.

Gliffy-to-draw.io Mass Conversion (paid-for)

There is a paid option available to convert Gliffy to draw.io when Confluence is still around.

The draw.io Confluence plugin can convert Gliffy to draw.io.

According to its documentation, it can do mass conversions of all Gliffy diagrams in a Confluence instance:

Also according to the documentation, it doesn’t require a license to do the actual conversion. But it will convert all Gliffy macros to draw.io macros as well, so having no license doesn’t seem to make sense.

Mass-converting Gliffy to draw.io before migrating to SharePoint seems like an option to have those draw.io files ready in SharePoint, in case users want to work with those diagrams using either app.diagrams.net or the draw.io Desktop app.

Migrating Gliffy and draw.io macros to SharePoint Online

WikiTraccs migrates Gliffy and Draw.io macros. This article has insights.

By Heinrich Ulbricht | Friday, July 19, 2024

We first look at how Gliffy and draw.io work in Confluence, then we look at a migration result.

How are the diagrams stored in Confluence?

This is a sample Confluence page (in edit mode) containing both the Gliffy and draw.io macro:

Both macros create two page attachments when you create a diagram.

One attachment file is the diagram file, containing the digram definition in a macro-specific format. The other attachment is the preview image that is shown on the page.

Here’s how the page attachmens for above sample page look. It’s two attachments per macro:

How do Gliffy and draw.io look in SharePoint Online?

First of all, WikiTraccs migrates the page content and page attachments as usual. So, both the diagram file and the preview image will be available in SharePoint as well.

When migrating page contents, WikiTraccs looks out for Gliffy and draw.io, and replaces those macros with a SharePoint image web part. The image web part is configured to show the respective macro preview image:

Looking at the SharePoint page’s attachments we can see the same files as in Confluence:

This is the result you can expect after migrating Confluence pages with Gliffy or draw.io macros to SharePoint Online using Wikitraccs.

Transforming even more colors from Confluence to SharePoint!

This post describes the latest additions and changes with regard to text colors, highlight colors, and table cell colors.

By Heinrich Ulbricht | Thursday, July 18, 2024

The latest release of WikiTraccs contains multiple improvements with regard to colors.

Text colors, the newly introduced text highlight colors (Confluence Cloud) and table cell colors got some attention.

Note

SharePoint does not support all colors that are available in Confluence. So WikiTraccs tries its best to map to a color that exists in SharePoint Online.

What has SharePoint Online to offer in terms of colors?

The SharePoint Online color palette for text looks like this:

Note that we cannot really use the theme colors as they change, depending on the theme. That leaves us with 11 text colors.

The SharePoint Online color palette for highlights looks like this:

That’s a whopping 15 colors.

What about table cell colors? SharePoint does not have those. We’ll see how WikiTraccs handles that further down.

Text Color Transformation

Those are the 21 text colors available (at least at the moment) in Confluence Cloud:

Text Colors in Confluence Cloud

WikiTraccs maps those colors to the visually nearest available text color in SharePoint Online:

Text Colors in SharePoint Online

Here’s another example from Confluence Server, which has 40 colors to offer:

Text Colors in Confluence Server 6

This is how it’s transformed to SharePoint:

Text Colors in SharePoint Online

Highlight Color Transformation (new)

Now for the new highlight colors in Confluence Cloud:

Highlights in Confluence Cloud

Those map pretty well to SharePoint Online in terms of available colors:

Highlights in SharePoint Online

SharePoint’s highlight colors are way darker than in Confluence which makes black text hard to read for the teal and purple highlights.

Table Cell Color Transformation

Here’s how table cell colors look in Confluence Cloud:

Table cell colors in Confluence Cloud

SharePoint does not support setting colors for table cells. Maybe that’s coming in the future, but so far I did not spot anything related on the roadmap. (Let me know if you see or hear something!)

WikiTraccs is using a workaround to at least indicate in SharePoint, that the table cell was colored in Confluence.

"Table cell colors" in SharePoint Online

Here’s another sample from Confluence Server 6, which has less cell colors to offer:

Table cell colors in Confluence Server 6

This is how it’s transformed to SharePoint:

Table cell colors in Confluence Server 6

In both instances, WikiTraccs adds a block of colored spaces to each formerly colored table cell, serving as a “color marker”. And again, it is restricted to colors that SharePoint has to offer.

What did change in the latest WikiTraccs release?

The latest release handles text highlight colors that have been introduced by Atlassian in June 2024.

Furthermore, the color markers for table cells now support the full range of text colors available in SharePoint. Previously, WikiTraccs used colored emoji squares like 🟩 as marker - but those only come in very few colors.

And last but not least: the algorithm to calculate the visual distance between colors has been improved. This maps Confluence colors even more closely to their SharePoint equivalents.

How to run parallel WikiTraccs migrations?

You might want to run multiple WikiTraccs migrations in parallel, to speed up things. Let’s take a look on how to do this and what to look after.

By Heinrich Ulbricht | Thursday, June 06, 2024

You can run multiple WikiTraccs instances in parallel to speed up the overall migration. You’ll need a separate machine or VM per WikiTraccs instance as only one instance is allowed to run on the same machine.

Some definitions

The computer or virtual machine (VM) WikiTraccs is running on is the machine or migration machine.

We’ll call the running WikiTraccs program a WikiTraccs instance. When two WikiTraccs programs are running, that would be two instances.

The central SharePoint site that WikiTraccs needs and where the Space Inventory list (and more) is located is called WikiTraccs site.

One WikiTraccs instance is allowed to run on one machine.

How to run multiple WikiTraccs instances in parallel?

Looking at client’s projects I see different approaches.

Some create cloud-based virtual machines in Azure or AWS. Some use spare laptops.

Install WikiTraccs to each of those machines and your are good to go.

Configuring WikiTraccs when parallelizing the migration

There is local and shared configuration.

Configure locally on all machines

Make sure you configure the WikiTraccs settings on all machines. This affects things like comments migration, macro ignore list, migration mode etc. They can be different on different machines, but usually are the same.

Locally stored settings are:

everything you enter into the blue WikiTraccs.GUI window
settings available in the blue WikiTraccs.GUI window via the Settings menu
any settings configured via appsettings.json

Note that the Space Inventory that contains the source-to-target mapping is NOT a local setting, but stored in SharePoint, in the WikiTraccs site. But the link to the WikiTraccs site (that contains the Space Inventory) is part of the local settings.

Be aware of shared configuration via WikiTraccs site

One WikiTraccs site can be used by multiple WikiTraccs instances, or each instance can use their own WikiTraccs site.

With the same WikiTraccs site, multiple WikiTraccs instances share the same mappings.

There is one caveat of sharing one WikiTraccs site: they will also want to migrate all the same content.

We’ll look at two two scenarios now: one with shared WikiTraccs site, one without.

Parallelization Scenario - Shared WikiTraccs Site

Multiple WikiTraccs instances share the same WikiTraccs site.

Two WikiTraccs instances sharing a single WikiTraccs site.

To achieve that, simply configure the same WikiTracc site address for all WikiTraccs instances.

Pro:

you need to configure the mapping in the Space Inventory only once
you need to configure the user and group mapping only once
easy, as there is only one WikiTraccs site

Contra:

(note: before WikiTraccs v1.18) you need to change the WT_Setting_RequestTransformation value in the Space Inventory before starting each WikiTraccs instance as otherwise multiple instances migrate the same pages (!); one approach is to use PnP.PowerShell to manipulate the space inventory

Tip

Starting with WikiTraccs v1.18 migration waves are supported. Those simplify distributing the migration to multiple WikiTraccs instances.

Assign wave numbers to selectors in the Space Inventory. Then select a different wave for migration in each WikiTraccs instance.

Important

With this scenario, you need to make sure that no two WikiTraccs instances migrate the same content at any given time.

Parallelization Scenario - Separate WikiTraccs Sites

Two WikiTraccs instances each using their own WikiTraccs site.

To achieve that, use a different WikiTracc site address for each WikiTraccs instance.

Pro:

the WT_Setting_RequestTransformation value in the Space Inventory can be configured per WikiTraccs instance, as each instance has its own Space Inventory
the snapshots that WikiTraccs takes of pages (in the Confluence Content Snapshots library), that are stored in the WikiTraccs site, are now scattered across multiple WikiTraccs sites, which can be an advantage permission-wise (when different teams do the migration that are not allowed to see each other’s content)

Contra:

the mapping in the Space Inventory needs to be configured in all WikiTraccs sites, for all spaces and CQL selectors (!)
the user and group mapping needs to be configured in all WikiTraccs sites

Important

With this scenario, you need to make sure to configure each instance properly and to configure complete mappings in each Space Inventory (even for spaces migrated by other instances). It’s important for link transformation of cross-space links.

General parallelization notes

Multiple instances can migrate to the same target SharePoint site, as long as they don’t migrate the same content at the same time to the same target.

Parallel migration is covered by the WikiTraccs license, without limits.

You shouldn’t go too crazy with the parallel instances. There are limits in place set by Microsoft on how fast an application can access the Microsoft 365 services. Microsoft will throttle WikiTraccs, if it goes to fast. This means: WikiTraccs will have to wait some time (a couple of seconds, or even minutes) until it is again allowed to send content. Those limits are set by Microsoft, adjusted to overall cloud load, and dependent on the number of licenses in the tenant (among other factors).

Tip

You can use different SharePoint migration accounts for each of the parallel WikiTraccs instances; this can help increase the throttling limits that are imposed by Microsoft.

Fixing image positioning in SharePoint

How to cope with the new way SharePoint handles images that are placed next to each other.

By Heinrich Ulbricht | Sunday, October 29, 2023

Note

Read Broken inline image positioning in SharePoint to learn the background story of this post.

Note

Images that are part of a list are still broken when editing a page, even when applying the Page Companion Fix described in this post. Another support case with Microsoft is open regarding this page regression: 2311041420000292.

Summary of what happened images in SharePoint

Up until about mid-2023 SharePoint text web parts were able to display inline images next to each other. Then Microsoft started upgrading the editor component that powers the text web part, while at the same time introducing changes to the overall page style. This broke adjacent image positioning.

Specifially, when editing a SharePoint page, its text web parts are auto-upgraded to the new editor version which positions once-adjacent images now vertically, not horizontally:

So what are our options to fix this?

How to fix image positioning? And who?

The SharePoint text web part, at the time of this writing, does not support putting multiple images next to each other. Neither in the browser editor, nor via programmatic means.

Here’s a list of parties that could influence image positioning in text web parts, in principle:

Microsoft - but they say the current behavior is by design, so they won’t help
WikiTraccs
You

Number one is out, let’s look at the other two options.

What can WikiTraccs do?

Short version:

At the moment, nothing.

Longer version:

When migrating Confluence pages to SharePoint, WikiTraccs creates pages that still show adjacent images. That’s because it’s generating pages in a version that still supports this. SharePoint supports displaying pages that use the (now) older version.

As soon as you edit such a page it is upgraded to a newer version that doesn’t support adjacent images anymore.

The only thing that WikiTraccs could do is starting to generate pages in such a way that images won’t be adjacent anymore, to begin with. You’d then see the non-adjacent version right away, without having to edit the page.

I’ll look into such options as soon as the new way to display pages is confirmed as generally available by Microsoft. I don’t have such a confirmation, yet.

That means that WikiTraccs cannot do anything at the moment.

What can you do?

You have two options:

Re-arrange images and content when you edit a page.
Roll back the layout change that Microsoft did to screw up image positioning.

Option 1 is straightforward: you edit a page, adjacent images are not next to each other anymore, you have to clean up.

Option 2 involves rolling back a single layout change that Microsoft introduces when upgrading the text web part.

The Wiki Transformation Project provides a tool to achieve option 2: the WikiTraccs Page Companion.

Introducing the WikiTraccs Page Companion

For option 2 to work, we need to change pages as they are displayed in the browser. It’s like duct taping a small piece of paper over a bad part of the page, using duct tape that can be removed without traces.

Fortunately Microsoft provides a supported means of integrating with SharePoint pages: SharePoint Framework (SPFx) Application Customizer. Using such a customizer, we can hook into each page as it is being displayed in the browser.

Regrettable, the change we need to make to how pages are being displayed is not officially supported by Microsoft. We need to accept that if we want to proceed with this duct tape option.

Note

This option does not modify page contents. It’s merely a temporary visual change. Disabling the application customizer restores out of the box behavior immediately.

Here’s a demo video of the WikiTraccs Page Companion in action. About 11 seconds into the video, note the little box reading Legacy image positioning (by WikiTraccs) in the lower right corner, that’s the extension:

(Note, because I get asked: the page tree you see in the video is WikiPakk, the SharePoint page tree experience.)

The toggle button can be used to toggle the image positioning fix on and off on a per-page basis. On means, the duct tape is applied. Off means, the duct tape is being removed, again. It is opt-in, so not enabled by default.

As an author, you would check the page you are editing, and toggle the button as needed. The fix affects the page both when being edited and being viewed.

The WikiTraccs Page Companion is available as pre-compiled download (to be deployed to a SharePoint app catalog), and also as source code for you to build yourself. It’s all on GitHub: WikiTraccs Page Companion.

Note that the non-standard nature of this option means that it might stop working when Microsoft makes changes to pages. If that happens, the adjustments that need to be done need to be re-evaluated.

I’m putting the source code out there, so that you are in control as well.

Get in touch if you have any questions.

Broken inline image positioning in SharePoint

A recent update to the text editor web part seems to break positioning for adjacent images.

By Heinrich Ulbricht | Saturday, October 07, 2023

Update October 22, 2023

Microsoft Support confirms that the behavior described on this page is expected. No adjacent images anymore.

Update October 29, 2023

Read the follow-up: Fixing image positioning in SharePoint

In general, SharePoint is behind Confluence when it comes to positioning images in relation to each other and to surrounding content.

But at least putting images next to each other is possible. Is? Was. s Those two blog posts took a deep dive into the topic of images:

Unfortunately a recent update to the SharePoint text editor web part breaks adjacent images. They aren’t adjacent anymore when editing a page.

Microsoft is upgrading the editor version of SharePoint pages - and the trouble starts

The text editor web part is powered by the CKEditor editor component. It can be licensed by any third party to build rich editor experiences in the browser. Microsoft did that.

The following things define how the text editor web part behaves:

the features that CKEditor provides - like formatting highlighted text, adding tables and so on
add-ons to the editor that Microsoft develops
internal settings set by Microsoft defining which features are available to users
surrounding styles (CSS) that define how things look (e.g. font size)
page contents

The most recent version of CKEditor is version 5 (v5). So far SharePoint used version 4 (v4) of CKEditor.

Microsoft now upgrades pages to use CKEditor v5.

Why is this important for content creators? Because the content of SharePoint pages is somewhat coupled to the editor version of the text editor web part. Microsoft cannot just upgrade from CKEditor v4 to v5 without breaking old pages. Therefore Microsoft auto-upgrade pages when they are being edited. And that modifies the page content.

When you start editing a page and its content briefly disappears and then comes back - this might have been an auto-upgrade.

And somewhere along the way of this auto-upgrade the handling of inline images changed.

How does the page upgrade affect adjacent inline images?

Here are three visuals showing how pages used to behave, and how they behave now.

How it used to be…

This is a page with three adjacent images in view mode, using CKEditor v4:

This is the same page in edit mode, using CKEditor v4:

How it’s now…

And here is a video that shows what happens when the auto-upgrade to v5 for this page kicks in:

Note: I used the browser’s developer tools to slow down the internet connection, which allows us to watch everything in slow motion.

What can be seen in above video:

the page enters edit mode and the three inline images can be seen next to each other for a brief moment
then all web part content vanishes; the auto-upgrade happens where SharePoint converts the content from CKEditor v4 to CKEditor v5
the web part content reappears; the images are placed on top of each other (NOT GOOD)
bonus: all images are gone! (this was a first when recording this video) (NOT GOOD)

Editing this page worked perfectly fine for at least the last 12 months. Images stayed in place, no problems.

Now broken.

What does this mean for pages that WikiTraccs migrated from Confluence to SharePoint?

Pages migrated by WikiTraccs use CKEditor v4. Adjacent images look fine when viewing those pages.

When editing a page with adjacent images SharePoint might decide to skip upgrading this page to v5. The page will look fine in view and edit mode.

When editing a page with adjacent images SharePoint might decide to upgrade this page to v5. Adjacent images won’t be adjacent anymore in edit mode and need to be re-arranged manually. This looks like a newly introduced technical limitation of SharePoint. Note that currently there is no way to put images next to each other via the browser editor.

Note: the same applies to pages transformed by other tools, e.g. when transforming classic SharePoint pages to modern SharePoint pages using the PnP Modernization Tooling.

Resources

I’m keeping track of topics related to page upgrades here.

Adjacent image positioning issues

Issues with adjacent image positioning are reported by the first WikiTraccs client in mid-September.

I created a Microsoft Tech Community post to get input: Modern pages with text web parts upgraded from CKEditor v4 to v5 lose image positioning.

I also created a minimal PowerShell script that can be used to create a SharePoint page with adjacent images to test the page upgrade: GitHub repository. To showcase and pinpoint the issue.

Furthermore I opened a case with Microsoft support, which is closed now. The case ID is 2310071420000151. According to them this is expected behavior.

Other known issues

There were issues with tables when upgrading pages from CKEditor v4 to v5:

related issue: https://github.com/SharePoint/sp-dev-docs/issues/9160
was forwarded to the product group

There are issues with the new spacing between paragraphs:

here’s a mighty thread about changes in paragraph spacings and the consequences: No paragraph spacing in text web part (Sharepoint modern pages)

Wrapping up

It’s not clear what exactly breaks image positioning on modern pages after upgrading the editor from v4 to v5.

It might be the content upgrade for those pages. But it might also be the changed surrounding styles that come with the upgrade.

I really hope that there will be a proper solution to putting images next to each other on a MODERN SharePoint page. Nothing more. Just some images. Next to each other.

How to migrate rich Confluence tables to limited SharePoint tables?

This blog post covers table migrations and how SharePoint differs from Confluence in this regard.

By Heinrich Ulbricht | Monday, July 03, 2023

Update

There is an update to the table story, and more options have been made available since this blog post was first published.

Read about it here: Making SharePoint Tables Look Pretty.

What are different table capabilities of Confluence and SharePoint?

The following table formatting options are available in Confluence:

setting the background color for table cells
having multiple header rows or columns
nesting tables
merging table cells

All of the above is not possible in SharePoint at the time of this writing.

Let’s see how WikiTraccs works around that.

How does WikiTraccs adjust for table formatting limitations in SharePoint?

WikiTraccs has to get creative when it comes to mimicking the full range of Confluence table formatting options.

Table cell background color is transformed to colored emojis. The color is moved from the cell background to the cell content, as there are no colored table cells in SharePoint.

Confluence table with colored cells

Table after migrating to SharePoint, with color markers

Only a limited set of rectangular colored emojis is available to do this, so the color often will be off.

Multiple header rows or columns are represented as normal table rows and columns in SharePoint where the text content is formatted bold.

Confluence table with multiple header rows and columns

Table after migrating to SharePoint, with bold text simulating the headers

Nested tables are de-nested. This approach is analoguous to how the the Microsoft SharePoint Migrations tool does it for SharePoint on-prem to SharePoint Online migrations.

Merged table cells cannot be created in SharePoint Online. But WikiTraccs marks those cells that once were merged (as of WikiTraccsrelease v1.3.11).

Here is an example of how it looks after migrating a table with merged cells from Confluence to SharePoint:

Confluence table with merged table cells

Table after migrating to SharePoint, without marking former merged cells (note: default behavior before WikiTraccs v1.3.11)

Table after migrating to SharePoint, with marking former merged cells (note: default behavior as of WikiTraccs v1.3.11)

The little arrow in former merged cells points in the direction of the cell it had been merged with before having been migrated.

Marking former merged table cells is standard as of WikiTraccs v1.3.11.

Wrap-up

WikiTraccs needs to get creative when it comes to transforming Confluence tables to SharePoint tables. The results depend on the complexity of the tables that are migrated from Confluence to SharePoint and a manual content review does not hurt.

Let’s hope that SharePoint keeps maturing to a point where formatting tables is on a par with what’s possible in Confluence.

Migrating large Confluence spaces to SharePoint

This post is about migrating spaces with 10000+ pages which can be challenging due to long page retrieval times.

By Heinrich Ulbricht | Sunday, July 02, 2023

Update

For Confluence 7.18 and up the challenge described in the blog post has been solved.

How does WikiTraccs determine which pages to migrate for a space?

When WikiTraccs starts to migrate a space to SharePoint it retrieves the full list of pages for this space.

Say a space contains 20000 pages. WikiTraccs retrieves basic information about those 20000 pages and adds them to a migration queue. (Later, when migrating each page, WikiTraccs retrieves the contents of this page and creates the corresponding SharePoint page.)

Confluence doesn’t allow retrieving information of more than 200 pages at once. So to retrieve information about 20000 pages WikiTraccs needs to request 100 batches of 200 pages, as that is supported.

This paged retrieval is where things get interesting for larger spaces.

Why can large Confluence spaces pose a challenge?

The more pages a space contains the longer it takes to retrieve a batch of pages.

Here are numbers from a Confluence 6 test migration of a space with about 23000 pages:

retrieving batch 1 takes < 1 second
retrieving batch 10 takes about 2 seconds
retrieving batch 25 takes about 4 seconds
retrieving batch 50 takes about 6 seconds
retrieving batch 75 takes about 10 seconds
retrieving batch 100 takes about 14 seconds

Assuming retrieving each batch of 200 pages takes a mean of 8 seconds, retrieving 20000 space pages would take 100*8=800 seconds, which is about 14 minutes.

Exporting this space (with empty dummy pages) via the Confluence space export function takes about 16 minutes:

Successful page migrations

Unfortunately there is not much that can be done about the time it takes to retrieve the list of pages. With release v1.6.8 WikiTraccs started to migrate pages while still retrieving the list of pages for a space. This at least allows the waiting times to be used to migrate the first pages.

Note

WikiTraccs logs the time it takes to retrieve a batch of pages, if it starts to take longer. Look out for those messages in the WikiTraccs.Console log or common log file:

Handled a batch of 200 pages for GIANT... (so far handled 18321)
Took 10.601631s to get space GIANT content (paged) from endpoint URL <...>

Confluence 7.18 introduced an API that allows for faster page retrieval, but so far there was no demand I am aware of as migrated environments were older. If you are migrating a Confluence version 7.18 or newer then please comment here to make this demand visible: User faster page retrieval API (Confluence 7.18 and up).

Conclusion

Large Confluence spaces (more than 10000 pages) can add significantly to the migration time.

When WikiTraccs starts migrating a space it retrieves a list of all space pages, which can take several minutes for those large spaces. The list of pages is retrieved multiple times, for example after migrating the space to check if all pages were migrated.

Those times can add up.

So it’s good to know the spaces with large page numbers beforehand, or to learn about them during a test migration.

Ignoring macros when migrating pages from Confluence to SharePoint

This post describes the new macro ignore list feature of WikiTraccs.

By Heinrich Ulbricht | Friday, May 19, 2023

Note

The Macro Ignore List is available as of WikiTraccs v1.2.0.

Why do I need a macro ignore list?

Certain Confluence macros are only visible when editing a wiki page. Often WikiTraccs can see those macros and migrates them to the SharePoint page - where they will be visible for visitors as well.

One sample is the Excerpt macro with the hidden option set.

WikiTraccs offers a macro ignore list that can be used to exclude such macros from the migration.

How to use the macro ignore list?

This series of screenshots shows how to identify macros to ignore and how to configure the ignore list.

Here’s a Confluence page with one visible macro, as shown for readers:

The Confluence page in edit mode reveals the second hidden Excerpt macro:

Without a configured Macro Ignore List WikiTraccs will migrate the hidden macro to SharePoint, making it visible. Here’s a screenshot of such SharePoint page:

To solve this, open the Confluence page, choose View Storage Format for the page and copy the offending macro XML:

In WikiTraccs.GUI, in the menu bar, choose Settings > Configure Transformation. Paste the macro XML to the Macro Ignore List:

Note that in above screenshot the macro XML has been formatted before pasting it to the Macro Ignore List. This can be done with text editors like Visual Studio Code or Notepad++ to improve readability. But this is optional and not needed for the Macro Ignore List to work.

In the Macro Ignore List, remove every macro parameter except hidden:

Delete previously migrated pages as needed after changing the Macro Ignore List, and start the migration.

The macro is now removed by WikiTraccs. Here’s the migrated page again as it looks in SharePoint:

Any macro can be added to the Macro Ignore List. Macros matching the Macro Ignore List will be removed from all pages in the Confluence to SharePoint migration.

During migration, WikiTraccs looks at the macro name and its parameters. If it sees a macro that has the same name, and all of the parameters and parameter values as a macro in the Macro Ignore List it will skip this macro and won’t add it to the SharePoint page.

Note

You can add multiple macro XML templates into the Macro Ignore List, just put them one after another.

How to monitor if the filter works as expected?

WikiTraccs logs removed macros while migrating pages from Confluence to SharePoint.

Search for the phrase Removed macro via config by matching template in the WikiTraccs log file to see which macros were removed and which template from the Macro Ignore List did match.

In addition, the Site Pages column WT: Transformation Log contains this information as well.

Wrap-up

The Macro Ignore List allows to specify which macros should be skipped in any Confluence to SharePoint page transformation.

It is one feature that was born directly from customer feedback. And it’s another step in the direction of a more configurable transformation process.

Get your migration started today!

Give WikiTraccs a try and check out its transformation capabilities!

Quick start guide >>

Start today with WikiTraccs’ free Trial Version:

Download FREE WikiTraccs now

Or get in touch via email if you are interested in a demo. Give it 45 minutes and you’ll be up to speed on how WikiTraccs can help you.

Registering WikiTraccs as app in Azure AD

By Heinrich Ulbricht | Wednesday, April 12, 2023

Azure AD is now Entra ID - please refer to the updated documentation: Registering WikiTraccs as app in Entra ID.

Registering WikiTraccs as app in Entra ID

This post shows how to get that Client ID needed by WikiTraccs.GUI.

By Heinrich Ulbricht | Wednesday, April 12, 2023

I recently got the following inquiry about WikiTraccs:

Please explain again briefly what the Tenant ID and Azure Client IDs are and how to acquire them.

This request is not the first of its kind and totally understandable. It refers to the following configuration in WikiTraccs.GUI:

Unless you are a Microsoft 365 developer or admin you normally never have to care about those IDs.

Why do I have to enter Tenant ID and Client ID in WikiTraccs? Where do those come from?

When WikiTraccs creates your migrated pages in SharePoint it needs to access APIs provided by Microsoft. Those are the SharePoint API and the Microsoft Graph API.

Accessing SharePoint in any form requires authentication. Open <company>.sharepoint.com (replace ‘<company>’ with the actual value of your SharePoint) in a private browser tab and it will ask for your credentials like email address and password.

Authentication is also required when WikiTraccs starts the migration. It will open a browser window for you to sign in.

Migration Account

It’s recommended to create a dedicated migration account. This account will be granted permissions to migration target sites only for the duration of the migration.

Besides users, applications like WikiTraccs have to authenticate as well to access SharePoint.

To allow an application to authenticate, it has to be registered in Entra ID (formerly known as Azure Active Directory, or Azure AD). This is a manual configuration that needs to be done once.

A registered application has an ID - the Client ID. And the Entra ID tenant it is created in also has an ID - the Tenant ID. Those are the IDs needed by WikiTraccs.GUI.

How to register WikiTraccs as Application in Entra ID?

An Entra ID tenant administrator has to participate in the following steps.

In a browser, navigate to https://entra.microsoft.com.

The Microsoft Entra admin center (former Azure Active Directory portal) opens.

In the left menu, under Applications, select App registrations, then New registration:

The Register an application blade opens:

Configure the following:

under Name enter WikiTraccs Migration Tool (note: or any other name of your choosing)
under Supported account types select Accounts in this organizational directory only
under Redirect URI (optional) choose Public client/native (mobile & desktop) and type http://localhost in the edit field

Select Register to confirm the app registration.

The application’s settings are now open:

In the left menu, under Manage, choose API permissions.

Choose Add a permission to add the following permissions:

Microsoft Graph > Sites.FullControl.All (delegated)
SharePoint > AllSites.FullControl (delegated)

Make sure to select the Delegated permissions (not application permissions). Here’s a sample screenshot:

The configured permissions need to be consented by the tenant admin.

As a tenant admin, choose Grant admin consent for… and confirm:

In the left menu choose Overview and make note of both the Tenant ID (aka Directory ID) and Client ID (aka Application ID):

Those IDs need to be entered in WikiTraccs.GUI as Tenant ID and Azure AD Application Client ID.

What did I just configure? Can WikiTraccs now access all content in SharePoint?

NO, WikiTraccs can NOT access all content in SharePoint.

Let me explain.

When starting a migration with WikiTraccs you have to log in with a Microsoft 365 account.

Now the permissions WikiTraccs gets are the intersection of two things:

the delegated permissions you configured above (that is Sites.FullControl.All, AllSites.FullControl) AND
the permissions of the account you logged in with

This is the magic behind delegated permissions. WikiTraccs can only access as much of SharePoint as the logged-in account.

If you log in with an account that is owner in all sites - sure, WikiTraccs could now access those sites.

But if you log in with a dedicated migration account that has only access to some sites - WikiTraccs can now only access those, nothing more.

Are there alternatives to those permissions?

I hear from customers who have trouble to get the tenant admin to consent permissions.

Use less permissions, get less things migrated

If you can’t get admin consent for FullControl permissions you can try using the following Manage permissions instead:

Microsoft Graph > Sites.Manage.All (delegated)
SharePoint > AllSites.Manage (delegated)

Those allow a content migration as well.

But the functionality of WikiTraccs will be limited somewhat:

page permissions cannot be configured, as WikiTraccs won’t be allowed to do so
out-of-the-box SharePoint page and file metadata Created By, Created (Date), Modified By, Modified (Date) cannot be set, as this requires the same permissions as configuring permissions

Note that an admin can also grant admin consent to those permissions.

Use an existing application

If you have access to another Entra ID application that has the required permissions configured you can use the Client ID of this existing app.

That’s why there is the Use M365 PnP Client ID button in WikiTraccs.GUI. It enters the well-known ID of the application used by PnP.PowerShell. This only works, if you have access to and the needed permissions are configured for the application (this is not always the case!). Note: The multitenant PnP.PowerShell enterprise application has been retired by Microsoft.

Troubleshooting

Check the platform

There is one common oversight when configuring the Entra ID application: selecting the right platform.

You might see the following error when testing the connection to SharePoint or starting the migration if the platform is not correct:

“A configuration issue is preventing authentication - check the error message from the server for details. You can modify the configuration in the application registration portal. See https://aka.ms/msal-net-invalid-client for details. Original exception: AADSTS7000218: The request body must contain the following parameter: ‘client_assertion’ or ‘client_secret’.”

This error happens when the platform of the Entra ID application has been set to Web instead of Public client/native (mobile & desktop).

Change the platform to Public client/native (mobile & desktop), which then shows up as Mobile and desktop applications here:

The error should now be gone.

Check the redirect address after authenticating

After logging in to Microsoft 365 via the browser, a redirect to http://localhost happens in the browser address bar.

This has to look like this:

Normally WikiTraccs takes over from this point. Get in touch if there are issues.

Test connecting with PnP.PowerShell

You can use PnP.PowerShell to check if an authentication issue is related to your Entra ID application configuration, or related to WikiTraccs doing something wrong.

Start PowerShell 7 and install the PnP.PowerShell module.

Configure and run the following script:

# make sure to use PowerShell 7 and install the PnP.PowerShell module; use the following command for that
# Install-Module PnP.PowerShell -Scope CurrentUser

# enter your SharePoint site to connect to here (the same as in WikiTraccs, either WikiTraccs site or default target site):
$siteUrl = "https://contoso.sharepoint.com/sites/somesite"
# enter your Entra ID application client id (application id) here (the same as in WikiTraccs):
$clientId = "017d043a-df74-4ab0-a411-430104faeaa4"

# authenticate with app, as user, and open login dialog in new browser window
Connect-PnPOnline -Url $siteUrl -ClientId $clientId -Interactive -LaunchBrowser:$true
# this should print the title of the site we connected to, if the connection was successful
Get-PnPWeb

This PowerShell script should open the Microsoft 365 login experience in the browser, just like WikiTraccs does. After logging in, it should print the title of the site you connected to. Otherwise, there will be an error that might help diagnose the app configuration issue.

Check that the migration user has access to the target SharePoint sites

If the migration user has no access to the SharePoint site, the following error might be shown:

Error while testing connection to target site...
HttpResponseCode: 403
Code: System.UnauthorizedAccessException
Message: Attempted to perform an unauthorized operation

Make sure the migration user account is site collection administrator of all target sites and the WikiTraccs site.

Check that the migration user is site collection administrator of the target SharePoint sites

If the migration user has only limited access to the SharePoint site, the following error might be shown:

Successfully connected to target site '2024-11-05-permission-test-deleteme2' but permissions are missing - refer to the docs for details 
(view list items: YES, view pages: YES, allowed to approve items: NO, manage lists: NO, add and customize pages: NO, manage the site: NO).

Please give permissions, site admin works well. See log for details. Cannot continue for now.

This happens if the migration user is only visitor, member, or contributor in the site. Those limited permission levels don’t allow WikiTraccs to properly migrate pages.

Make sure the migration user account is site collection administrator of all target sites and the WikiTraccs site.

Check the log files for details

Note: this log output is avilable as of WikiTraccs v1.20.40.

WikiTraccs logs information about both the Entra ID authentication result and the SharePoint site access to the common log files.

When authenticating with Entra ID, WikiTraccs receives an access token. The claims of those access token can be seen in the log files:

[ 15:59:14 INF] #1 === START Claims of Access Token === | 
[ 15:59:14 INF] #1 aud=00000003-0000-0ff1-ce00-000000000000 | 
[ 15:59:14 INF] #1 amr=pwd | 
[ 15:59:14 INF] #1 amr=mfa | 
[ 15:59:14 INF] #1 app_displayname=Confluence Migration | 
[ 15:59:14 INF] #1 appid=abcae113-5a68-4df9-b0f9-70243943beef | 
[ 15:59:14 INF] #1 family_name=Vance | 
[ 15:59:14 INF] #1 given_name=Adele | 
...
[ 15:59:14 INF] #1 scp=AllSites.FullControl Sites.FullControl.All | 
[ 15:59:14 INF] #1 tid=1acdc182-0353-4ee1-b9fa-225ce104406b | 
[ 15:59:14 INF] #1 === END Claims of Access Token === |

Above claims show that user Adele Vance authenticated with app Confluence Migration, using interactive login, being granted delegated permissions AllSites.FullControl and Sites.FullControl.All. You can use that to double-check your configuration.

Then, when accessing SharePoint, WikiTraccs logs information about the migration user’s access to the SharePoint site:

[ 15:59:14 INF] Logging SharePoint site 'Test Site' (/sites/test-site) permission details (for diagnosing access-related issues): | 
[ 15:59:14 INF] PermissionKind."ViewListItems": true | 
[ 15:59:14 INF] PermissionKind."AddListItems": false | 
[ 15:59:14 INF] PermissionKind."EditListItems": false | 
[ 15:59:14 INF] PermissionKind."ManageLists": false | 
...
[ 15:59:14 INF] PermissionKind."ManageWeb": false | 
[ 15:59:14 INF] PermissionKind."FullMask": false |

Here, the migration user appears to have read-only access because they are allowed to view list items, but are not allowed to add list items, or manage lists.

Make sure the migration user account is site collection administrator of all target sites and the WikiTraccs site.

Incremental consent might be required, if, after successful authentication, WikiTraccs shows (and logs) messages that look like those:

No permission granted to SharePoint, whatsoever.

Some permission to SharePoint has been granted via other permission scopes, but not enough to even read content.

WikiTraccs asks for the special .default permission scope when the user authenticates with Entra ID. The resulting access token contains all permission scopes that have been consented to by an admin on behalf of the organization (note: this is required for all FullControl scopes), and scopes that have been already consented to by the migration account.

The latter will cause issues when working with Sites.Manage.All and AllSites.Manage scopes that don’t require admin consent. Those scopes might be missing from the access token. Refer to the log files to check the presence of those scopes.

Specifically, when Sites.Manage.All or AllSites.Manage are missing in WikiTraccs’ access token, you have to invoke the consent dialog for those scopes for the migration account with an individual consent URL that looks like this:

https://login.microsoftonline.com/00000000-0000-0000-0000-000000000000/oauth2/v2.0/authorize
?client_id=11111111-0000-0000-0000-000000000000
&response_type=code
&redirect_uri=http://localhost
&response_mode=query
&prompt=select_account
&scope=https://graph.microsoft.com/Sites.Manage.All+https://contoso.sharepoint.com/AllSites.Manage

Modify above URL template as follows:

Instead of 00000000-0000-0000-0000-000000000000, enter your SharePoint Tenant ID (note: you can see the tenant ID in the blue WikiTraccs window)
Instead of 11111111-0000-0000-0000-000000000000, enter the client ID of your Entra ID application
In https://contoso.sharepoint.com/AllSites.Manage replace contoso.sharepoint.com with your SharePoint host

So, the URL might look like this:

https://login.microsoftonline.com/6dbba70a-8629-4c31-9991-e2cbcd6f6d90/oauth2/v2.0/authorize?client_id=1df1b835-b821-413b-9c42-c67313e84c35&response_type=code&redirect_uri=http://localhost&response_mode=query&prompt=select_account&scope=https://graph.microsoft.com/Sites.Manage.All+https://adventureworks.sharepoint.com/AllSites.Manage

Open a browser where the migration account is authenticated to SharePoint, copy the final URL to the browser address bar and press the Return key, to navigate to this address. Follow the login flow.

Eventually, a consent dialog like the following should open in the browser:

Click Accept to give consent. This unblocks the migration.

Repeat the connection test in WikiTraccs. The error message should change, telling you that you don’t have full control access, but that migration should still work. Also, when starting the migration, it should now create pages in SharePoint.

Wrap-up

WikiTraccs needs an Entra ID application registration to access SharePoint. This is true for all applications integrating with Microsoft 365 services.

A common challenge is getting the right people to consent the configuration. The purpose of the app needs to be communicated clearly.

Technically, registering the app registration is quickly done following the steps in this post.

And last but not least, the migration user needs proper access to all target SharePoint sites.

Try WikiTraccs!

Give WikiTraccs a try and check out its transformation capabilities!

Quick start guide >>

Start today with WikiTraccs’ free Trial Version:

Download FREE WikiTraccs now

Or get in touch via email if you are interested in a demo. Give it 45 minutes and you’ll be up to speed on how WikiTraccs can help you.

The art of positioning images - part 2 of 2

How are images handled when migrating pages? In this blog post we look at actual migration results.

By Heinrich Ulbricht | Tuesday, March 21, 2023

Update October 22, 2023: Microsoft broke it

Mid-2023 Microsoft started upgrading the editor behind the text web part and broke adjacent images in the process. Read more here: Broken inline image positioning in SharePoint.

This blog post is the second of a two-parter about images in the context of a Confluence to SharePoint migration:

Part One: Why is image positioning important and which options do I have in Confluence and SharePoint?
Part Two: What do migrated images look like in SharePoint, with the focus on retaining positioning? (this post)

In the last post we looked at image positioning options in Confluence and SharePoint. How can images be formatted in Confluence? How in SharePoint?

Now we look at actual migration results.

The easy ones: images in their own paragraph

An image in a separate paragraph:

Confluence	SharePoint

Apart from SharePoint taking more space in general the layout looks comparable.

Now a centered, floating image:

Confluence	SharePoint

SharePoint can do that, good.

Off to something SharePoint cannot do well.

The problematic ones: images as “characters” in the text flow

Here is something that SharePoint cannot do: putting an image at the end of a line:

Confluence	SharePoint

How about putting text to the right of an image:

Confluence	SharePoint

This looks better, although WikiTraccs has to cheat here to make it look like this. This image is converted to a floating one, with the text flowing to the right. More on floating images in the next section.

By the way, putting images next to each other for Confluence is like putting characters next to each other. WikiTraccs lets those images float as well, which can look pretty decent in SharePoint:

Confluence	SharePoint

And something that is just not possible in SharePoint: putting an image right inside the text, like a “giant character”:

Confluence	SharePoint

Now we have a look at floating images.

The compatible ones: floating images

Text can flow around floating images on the left or on the right side.

Here is a left-floated image, with text flowing around it on the right side:

Confluence	SharePoint

Note

The layout in SharePoint will always differ because the font size and spacing is different. This is especially relevant when things in Confluence have been positioned by pressing the Return key multiple times. Those empty lines will look different in SharePoint.

And here is a right-floated image, with text flowing around it on the left side:

Confluence	SharePoint

A special case…

A common pattern that I could observe on Confluence pages is to put some explanatory text next to an image like so:

Confluence	SharePoint

There is a trick WikiTracks needs to play to make this work at all.

WikiTraccs did insert a one-pixel “floatbreaker” image to wrap the text at the right spot, after the sentence “One lone sentence next to the image”.

Here you see the SharePoint page in edit mode, on the left with floatbreaker (selected in the editor), on the right after deleting it.

SharePoint with “floatbreaker”	SharePoint without “floatbreaker”

The floatbreaker is another tiny, invisible image in the SharePoint page with positioning set to center. That breaks the text flow where it’s needed. And it’s the only way I found to stop the other text from flowing around as well.

Kind of hacky, but it works and it survives entering the page edit mode. Of course page authors are free to remove the dummy image and change the formatting.

Note

There is a setting that can be used to prevent the floatbreaker image from being added during migration: EnableDummyImageFloatResetForImages. The default is on.

Wrap-up

Migrating images from Confluence to SharePoint is no easy task for WikiTraccs. Confluence provides more formatting options than SharePoint.

And don’t mention Confluence pages that were created by importing from Word files! Somebody will start crying.

But WikiTraccs will make sense of the most common cases it finds. This should make those image-laden instruction pages readable in SharePoint as well.

Note

If you find something that is off please open a ticket and I’ll have a look. Please add the (anonymized) page in storage format XML so I can have a look what’s going on there.

Try WikiTraccs!

Give WikiTraccs a try and check out its transformation capabilities!

Quick start guide >>

Start today with WikiTraccs’ free Trial Version:

Download FREE WikiTraccs now

Or get in touch via email if you are interested in a demo. Give it 45 minutes and you’ll be up to speed on how WikiTraccs can help you.

The art of positioning images - part 1 of 2

How are images handled when migrating pages? In this blog post we look at what is possible in Confluence and SharePoint.

By Heinrich Ulbricht | Sunday, March 12, 2023

This blog post is the first of a two-parter about images in the context of a Confluence to SharePoint migration:

Part One: Why is image positioning important and which options do I have in Confluence and SharePoint? (this post)
Part Two: What do migrated images look like in SharePoint, with the focus on retaining positioning?

Off we go.

Why is image positioning important?

Hey ChatGPT, why is image positioning important in a Confluence wiki page?

ChatGPT says...

Image positioning is important in a Confluence wiki page for several reasons:

Enhancing readability: Proper image positioning can make the page more visually appealing and help readers quickly locate relevant information. If an image is placed in a way that disrupts the flow of text or makes it difficult to read, it can detract from the overall user experience.
Conveying information: Images are often used to convey information that is difficult to explain in words alone. Proper positioning can ensure that the image is placed near the relevant text and is large enough to be clearly visible.
Aesthetics: A well-designed page with carefully placed images can make the information more visually appealing and engaging for the reader. It can also help to break up large blocks of text and make the page more visually interesting.
Accessibility: Proper image positioning is also important for ensuring accessibility. Images should be positioned in a way that makes them easy to view for all users, including those with disabilities. This may involve adding appropriate alt text, captions, or descriptions to images, and ensuring that they are positioned in a way that is easy to view using assistive technology.

Overall, proper image positioning is an important aspect of creating a well-designed and effective Confluence wiki page. It can help to enhance readability, convey information, improve aesthetics, and ensure accessibility for all users.

Sounds about right.

Note

The next release of WikiTraccs contains lots of image-related improvements. Stay tuned.

Let’s look at what Confluence has up its sleeve for images.

How can images be positioned in Confluence?

Images as a “giant characters”

You can upload images and add them anywhere on a wiki page.

When inserting an image into text it behaves like one big character:

The image cut the word in half and sits there as if it were text, while taking its space.

Spoiler alert: SharePoint cannot do this. This will become relevant when looking at how WikiTraccs migrates such images to SharePoint.

Here is another sample, looking at it from different angles.

First there is the image, without text around it.

Then a line break on the right edge is removed, hoisting text up next to the image.

Then the image itself is hoisted up, now having text on its left edge.

And then the image is again put right into the text, and when making it smaller it starts to fit in.

This mode of positioning images works especially well for putting multiple images next to each other:

Now let’s look at another image positioning option.

Floating Images

There is a different way of positioning Confluence images known as floating.

Text can flow or wrap around such images.

Here is a sample on how to do this in Confluence Server 8:

Note

The wrap behavior and the buttons to use might differ between Confluence versions, and also Confluence Cloud.

And that’s it. Now we look at SharePoint.

How can images be positioned in SharePoint Online?

SharePoint does only support the second way of positioning images - floating them.

Here’s a demo of how inserting an image into text works in SharePoint:

The text is split into two parts above and below the image, the image is centered.

Selecting the image shows buttons that control image positioning:

This behavior looks like in Confluence.

Content is either wrapping on the left or right of the image, or the image is centered with text above and below it.

Now I tried to put two images next to each other.

I tried copy and paste. I tried to insert multiple at once. I tried moving them around.

Here’s me failing to position two images next to each other:

And that sums up the state of inline images in SharePoint pretty well. I was not able to put images next to each other.

Luckily WikiTraccs can control the content on a finer level than me with the mouse, while still creating content that adheres to SharePoint standard. More on that in the next post.

Note

The next release of WikiTraccs contains lots of image-related improvements. Stay tuned.

Try WikiTraccs!

Give WikiTraccs a try and check out its transformation capabilities!

Quick start guide >>

Start today with WikiTraccs’ free Trial Version:

Download FREE WikiTraccs now

Or get in touch via email if you are interested in a demo. Give it 45 minutes and you’ll be up to speed on how WikiTraccs can help you.

Confluence Page Tree in SharePoint

This post demos hierarchical page navigation in SharePoint.

By Heinrich Ulbricht | Sunday, March 05, 2023

🌳 UPDATE - WIKIPAKK IS OUT 🌳

The WikiPakk breadcrumb and page tree experience for SharePoint is available from Microsoft AppSource!

Get it from the Microsoft app store, test for free: App Source.

Learn more on the web site: WikiPakk.

No need to build a dynamic SharePoint page tree from scratch. It has been done. Use WikiPakk.

How to create a dynamic page tree in SharePoint?

That is a question I hear time and time again.

Pages in SharePoint have no hierarchy (in contrast to Confluence). All modern pages are basically stored at the same level in the Site Pages Library.

Ever since starting working with SharePoint over 10 years ago I helped customers build trees of different sorts:

Search-driven page trees
Static page trees (deeper that the standard allows)
Taxonomy-driven page trees
Metadata-driven page trees

There is still no ready-made solution we can take and use to make navigating migrated pages easier. At least none that is free. (If you know one get in touch and I’ll update the post!)

Let’s effin build a page tree!

You’ll probably need a page tree in SharePoint if you are using WikiTraccs to migrate pages from Confluence. Users might be lost without it, depending on the number of pages.

Here is a demo of how such a page tree can look in SharePoint:

What are we seeing here?

The animation starts with a view of the Site Pages library. WikiTraccs was used to migrate the shown pages from Confluence to SharePoint.

Then a page is clicked, opening this page.

Above the page you can see a new bread crumb navigation and a page tree icon.

Clicking the page tree icon opens a panel on the right side that shows a dynamic page tree. The current page is highlighted. Any other page can be opened by selecting it in the tree.

Note: the page trees that are shown as part of the page content are static views of the Confluence page tree macro. This could be the next step: replacing those static placeholders with dynamic web parts. But that is something for the future.

How was it done?

The bread crumb and page tree above are metadata-driven.

When migrating pages to SharePoint WikiTraccs takes care of migrating page tree metadata as well:

Page ID
Parent page ID
Page order

Every migrated page carries this information in the Site Pages library.

The bread crumb and page tree are part of a SharePoint Framework (SPFx) solution.

The bread crumb is inserted above every page as application customizer SPFx extension.

Data is read from SharePoint using PnPjs.

And finally the user interface is a combination of Fluent UI and Material UI.

No additional backend code is involved, this is pure client-side code.

I want such a page tree!

Ok you can have it 😇.

Although it’s more of a demo to see if a page tree in this form proves useful.

Some things could certainly be done feature-wise. It also needs polishing in non-migrated-page contexts.

You could also build one on your own - the metadata is there!

Anyway, let me know if you want to try it. There is no official release, yet.

I’m trying to get the SPFx solution into Microsofts store AppSource. But I heard that this can be pretty complicated, so not sure if this will succeed. Fingers crossed.

As always: Try WikiTraccs!

Give WikiTraccs a try and check out its transformation capabilities!

Quick start guide >>

Start today with WikiTraccs’ free Trial Version:

Download FREE WikiTraccs now

Or get in touch via email if you are interested in a demo. Give it 45 minutes and you’ll be up to speed on how WikiTraccs can help you.

What about those Confluence Macros?

When referring to content migration it’s good to know which content is handled how, and macros are a big part of that.

By Heinrich Ulbricht | Sunday, February 26, 2023

Let’s start with something that has different meanings depending on context.

Content

Page content for space owners might encompass everything on and around a Confluence page: text, macros, attachments, metadata and maybe also comments and permissions.

All those things would ideally be migrated 1:1 to SharePoint. Migrate, turn off Confluence, done.

You can read about different types of Confluence content and how it’s handled by WikiTraccs here: Features.

As you might expect: it’s not that easy.

There are different angles to approach this topic and the different types of content one might expect to be migrated.

For today I want to highlight one type of content that needs awareness and preparation: macros.

What’s in a macro?

Confluence macros can be used to display different types of content.

Macros can aggregate content from other pages. Or they can show search results. They also allow entering unique content, like the Code or Info macro.

When WikiTraccs encounters macros it has different strategies to handle them:

replace by something similar in SharePoint
replace by a static representation (e.g. Page Attachments and Page Tree macros)
replace by its rich content (e.g. Expand macro)
replace by a mere text placeholder

Let’s look at an example page in edit mode:

Above Confluence page features text and three macros:

the Info macro, containing text that I entered
the Favorite Pages macro that shows my favorite pages
the Lorem Ipsum macro that generates a random paragraph of text

Here’s how this page looks after having been migrated to SharePoint:

This looks pretty similar.

WikiTraccs, when migrating the page, retrieved text, the Info macro plus content and markers for the other two macros. And because Favorite Pages and Lorem Ipsum web parts don’t exist in SharePoint they are represented by a text placeholder in the resulting page.

Let’s now look at the page in Confluence without being in edit mode:

Confluence knows what Favorite Pages and Lorem Ipsum macros are supposed to do and shows additional, dynamically generated, content.

This, of course, is not possible to replicate in SharePoint.

Now what?

There’s not much WikiTraccs can do about macros not being available in SharePoint.

WikiTraccs already has special handling built-in for a handful of frequently used macros, like the Page Attachments and Page Tree macros.

Read about how WikiTraccs handles the Page Attachments macro here: New attachments macro transformation.

If you’ve got a macro to migrate that you’d like me to take a look at please use any of the Contact options.

For many macros you’ll see a placeholder in the migrated page.

Have a look at the list of known macros and how they are handled here: Known Confluence Macros.

Let’s zoom out for a moment.

It’s best to see this as a chance rather than an obstacle. SharePoint is different from Confluence. And as part of Microsoft 365 there is a good chance that something powerful can be built on the Microsoft-side of things.

And zooming in again here are some points for the migration-planning check list:

create an inventory of spaces and check which can be excluded from the migration
create an inventory of macros that are in use on Confluence pages in to-be-migrated spaces
evaluate how removing macros might affect legibility and understandability of pages
take a look at SharePoint standard web parts as well as community solutions like PnP Samples
think about how the content will be used in SharePoint; is it like an archive or will the content be worked on?
identify use cases that are affected by macros not being available in SharePoint anymore
re-think use cases to SharePoint / Microsoft 365 where needed (don’t forget about Microsoft Teams, Viva, the Power Platform etc.)

Something wholistic like a transformation workshop or down-to-earth requirements analysis might be in place. (My colleagues at Communardo can support with those.)

With WikiTraccs it’s free and easy to just test the migration. Looking at the results can spark meaningful discussions as well.

Get started today!

Give WikiTraccs a try and check out its transformation capabilities!

Quick start guide >>

Start today with WikiTraccs’ free Trial Version:

Download FREE WikiTraccs now

Or get in touch via email if you are interested in a demo. Give it 45 minutes and you’ll be up to speed on how WikiTraccs can help you.

What to expect from WikiTraccs?

So what can you expect from the migration tool WikiTraccs? Or even more general: What to expect from any migration tool?

By Heinrich Ulbricht | Sunday, February 19, 2023

I often demo WikiTraccs to potential customers. Those demos spark discussions around the future structure, navigation patterns, usage scenarios, and more.

These are all topics that need to be worked on in the context of a migration project.

WikiTraccs can support your migration efforts.

It can migrate content from spaces to sites. It can migrate permissions and metadata. It has certain limitations. And it has distinct capabilities.

But there are things that WikiTraccs is not meant to do.

So let’s turn the headline around.

What to not expect from WikiTraccs

The following topics need to be worked on in the context of a transformation workshop, a migration project, or whatever format you choose to generate answers:

choose what to migrate - content only? metadata? permissions?
make decisions about which content goes from where to where - which spaces need to be migrated? which archived? which deleted? which sites do we migrate to? how many are there? how are they created? use new ones or existing ones?
know the amount of content to migrate (at least not in a demo context) - 10, 100 or 1000 spaces? 1000, 10000 or 100000 pages? 10, 100 or 1000 GB of attachments?
plan an access scheme for the SharePoint sites - users, groups, permissions given at which level in the SharePoint hierarchy of elements?
decide which macros are important for users, and which not
choose a strategy for your migration - big bang or in waves? who needs to be informed? how? when?
choose migration environments - need test environments? how many? where do those come from? how exact do they need resemble the production environment?
…and more, driven by your use cases

WikiTraccs cannot do this for you.

So what to expect from WikiTraccs?

WikiTraccs will help you realize the plans you made.

Let’s assume the following simplified migration plan:

📄 migrate all Confluence pages, from all spaces (10000 pages overall; 20000 attachments, mostly images)
🏗️ one target site for each source space
👥 keep page author and editor, no permissions
📝 accept that macros will mostly be lost and plan to educate users accordingly
⚙️ run multiple test migrations
📅 choose a weekend for the final migration

Here’s how WikiTraccs has got your back:

📄 WikiTraccs migrates content and metadata
🏗️ WikiTraccs migrates according to your space-to-site mapping
👥 WikiTraccs replaces Confluence user accounts by Azure AD accounts according to your mapping
📝 WikiTraccs handles macros according to the known macros list
⚙️ you test-migrate as often as you need (the license is not bound to volumes of data)
📅 using the experience from multiple test migrations verify a migration time of about 2 days

A demo is a good way to see this in action.

Testing yourself is even better. You can quickly put above steps into action in your environment.

Give it a try!

Give WikiTraccs a try and check out its transformation capabilities!

Quick start guide >>

Start today with WikiTraccs’ free Trial Version:

Download FREE WikiTraccs now

Or get in touch via email if you are interested in a demo. Give it 45 minutes and you’ll be up to speed on how WikiTraccs can help you.

New attachments macro transformation

What’s new regarding page attachments discoverability in SharePoint and the new attachments macro transformation?

By Heinrich Ulbricht | Sunday, January 15, 2023

WikiTraccs migrates a snapshot of the attachments macro

The Confluence attachments macro shows a list of page attachments, which looks like this:

A Confluence page with attachments macro.

WikiTraccs now transforms this macro to a table in the target SharePoint page. The result looks like this:

A SharePoint page with attachment links in a table.

This feels a lot better than the placeholder that has been generated before.

And do you see the Page Attachments section at the bottom? Let’s talk about that.

Attachments are also available via SharePoint web part

WikiTraccs creates a Page Attachments section on each SharePoint page with attachments. This allows for easy access of page attachments even on pages that had no page attachments macro.

The section is collapsed by default. Click the section title to expand it:

SharePoint list web part showing page attachments.

This section contains a standard SharePoint list web part. It shows all attachments of the page.

Where are those page attachments even stored??

SharePoint puts files that belong to pages in the Site Assets library. Each page has its own folder. And this is where WikiTraccs puts page attachments when migrating a page from Confluence to SharePoint.

The Site Assets library contains all page attachmetns.

The web part in a page’s Page Attachments section shows the content of the page’s folder.

Summary

Both the page attachment macro transformation and the list web part that shows page attachments should make working with page attachments a lot smoother. They are an addition to the latest release of WikiTraccs.

Give it a try

Give WikiTraccs a try and check out its transformation capabilities!

Quick start guide >>

Start today with WikiTraccs’ free Trial Version:

Download FREE WikiTraccs now

Or get in touch via email if you are interested in a demo. Give it 45 minutes and you’ll be up to speed on how WikiTraccs can help you.

Mapping principals and migrating permissions

This blog post covers new features allowing to migrate permissions and map users and groups from Confluence to SharePoint.

By Heinrich Ulbricht | Sunday, January 01, 2023

This post has been updated in October 2023 to reflect the latest developments.

Terminology

We need to define some terms before starting into the topic.

The term permissions is used broadly to describe which audiences can access which content in Confluence or SharePoint, and to which extent. Those audiences can be users and groups. Users and groups are collectively referred to as principals.

When explicitly talking about permissions for Confluence pages the term page restrictions will be used as this is the correct Confluence terminology.

A sample permission migration

We’ll do a sample permission migration. Screenshots from this migration will be shown throughout the blog post.

For the sake of this sample migration, let’s assume we want to migrate the following Confluence pages:

Confluence Pages

Each of those pages has a different permission configuration. Some pages have view and edit restrictions set. Some for users, some for groups, and some for both. And some pages form a hierarchy to illustrate the challenges associated with that.

How to migrate permissions with WikiTraccs

Opinion from the author of WikiTraccs

If you can, don’t migrate permissions.

Spare yourself the hassle that is migrating permissions from Confluence to SharePoint. It rarely provides value compared to rethinking the information architecture in SharePoint, which will likely yield a different result than what has grown in Confluence. Besides, a 1:1 permission migration is nearly impossible as soon as you’ve got permission hierarchies in Confluence that involve groups.

If you want to migrate permissions, do a test migration first.

If there is a tight deadline and you did not test-migrate, don’t stumble into the production migration. Test-migrate first.

That being out of the way, the rest of this post shows how to migrate permissions.

WikiTraccs can migrate page restrictions from Confluence to SharePoint (within certain limits, more on that later). Migrating page restrictions is done in three steps.

STEP 1: Migrate content and configure mappings for users and groups

First, you migrate page contents. WikiTraccs stores Confluence page restriction and principal information in SharePoint lists, along the way.

Choose content migration mode

In the main blue WikiTraccs.GUI window, from the main menu, choose Settings > Configure Transformation.

In the Transformation Settings dialog, in the Migrations tab, choose Migrate Content as migration mode.

Select Ok to close the dialog.

When a page is migrated, WikiTraccs also receives page restrictions for this page as well as information about users and groups involved in those restrictions.

No permission configuration is applied to SharePoint pages, yet.

After successfully migrating the Confluence pages shown earlier (only content!) this is how the Pages Library of the migration target site looks:

Migrated Confluence Pages in SharePoint Pages Library

In addition, the document library Confluence Page Permission Snapshots (located in the WikiTraccs site) now contains permission snapshots for those pages, waiting to be applied by WikiTraccs:

Confluence Page Permission Snapshots

Those permission snapshots contain information about a page’s restrictions.

Then there’s a third list that WikiTraccs populates, the Confluence User and Group Mappings list (also located in the WikiTraccs site):

Confluence Users and Groups

You now configure the user and group mapping in this list. WikiTraccs needs to know which Confluence principal maps to which SharePoint principal.

STEP 2: Mapping Confluence users and groups to SharePoint

One factor for a successful permission migration is the mapping of Confluence users and groups to their counterparts in SharePoint. You configure this mapping manually.

Let’s configure the mapping like this, in the Confluence User and Group Mappings table (located in the WikiTraccs site):

Mapping Confluence to SharePoint Principals

The highlighted column WT_Setting_MapForPermissions contains accounts from Entra ID and SharePoint groups.

Those two mappings shall be highlighted in particular, as we’ll revisit them in STEP 3:

the Admin Heinrich (admin) Confluence account is mapped to two target accounts Wiki SpaceAdmin and Heinrich Admin Ulbricht
the Wiki Reader (wiki.reader) Confluence user is mapped to the Wiki Reader AD account

When you are done with the mapping, proceed with the next step.

Mapping groups

Group mappings are based on SharePoint groups. Due to technical reasons, those SharePoint groups have to be created both in the WikiTraccs site and in the target SharePoint sites.

Group members only have to be configured in the target sites, though. The SharePoint group in the WikiTraccs site is merely a placeholder, so the group can be chosen as migration target in the people picker.

Automating the creation of those SharePoint groups is recommended, e.g. using PnP.PowerShell.

Note

You can change the mapping later and run the permission migration again, to update SharePoint page permissions.

STEP 3: Apply permissions to SharePoint pages

In this last step WikiTraccs applies the stored permission information to the already migrated SharePoint pages. This step has to be started manually and separately from the first step.

Enable permission migration mode

In the main blue WikiTraccs.GUI window, from the main menu, choose Settings > Configure Transformation.

In the Transformation Settings dialog, in the Migrations tab, choose Migrate Permissions as migration mode.

Select Ok to close the dialog.

Note

Separating the two steps - retrieving permissions information and applying those permissions to SharePoint - allows for a more flexible migration of content and permissions.

After choosing permission migration mode, start the migration as usual. WikiTraccs now applies the already migrated permission information to the already migrated pages in SharePoint.

We now have a look at a page permission migration result in depth.

In Confluence, the restrictions for page Page with view-only restriction for Wiki Reader look like this:

Restrictions of Page "Page with view-only restriction for Wiki Reader" in Confluence

Here’s the resulting permissions for the migrated SharePoint page:

Permissions of the Migrated SharePoint Page

Looking at the principals and permission levels this is what happened:

Confluence account “wiki.reader” was mapped to the “Wiki Reader” AD account and got Read permissions which corresponds to the Confluence view restriction
Confluence account “admin” was mapped to both the “Heinrich Admin Ulbricht” and the “Wiki SpaceAdmin” accounts and got Contribute permissions which corresponds to the Confluence edit permission
“WikiTraccs Admin” is the migration account and got Full Control permissions (the migration account is always added)

The basis for this permission configuration had been layed in step 2 when you configured the principal mappings.

Tracking permission migration progress

WikiTraccs logs the result of each page permission migration in the Page Transactions list, that is located in each migration target site in SharePoint.

For our sample migration this looks like this:

Permission Migration Results

There are several reasons why migrating Confluence page restrictions to SharePoint might fail.

The good thing is that a range of permission migration failures can be fixed easily. The permission migration can be run multiple times and will retry failed migrations. This can be repeated until all resolvable errors have been resolved.

Going back to our sample migration you might have noticed that (in the last screenshot) two permission migrations where not successful. For two pages the result is cannot merge hierarchical group restrictions when calculating merged page restrictions which means that for those pages restrictions could not be migrated.

The next section takes a deeper look at what WikiTraccs currently can and cannot do when it comes to migrating permissions.

Capabilities and limitations with regard to migrating permissions

WikiTraccs currently has two types of limitations when it comes to permission migrations: not-implemented features and nearly-impossible-to-add features.

But first, here is what WikiTraccs will do:

apply Confluence user and group page restrictions to SharePoint modern pages, when no hierarchy of page restrictions is involved
apply hierarchical page restrictions from Confluence to SharePoint modern pages in certain narrow circumstances (e.g. only users are involved)
provide a mapping opportunity for each Confluence principal to one or more Entra ID accounts and SharePoint groups
map the Confluence view and edit restrictions to corresponding SharePoint permission levels via a hard-wired logic

Features that might be useful in the future, but are not currently implemented:

using other group types than SharePoint groups as mapping target
provide fallbacks for when a mapping is missing or empty
configurable behavior for failed permission migrations (should the SharePoint page be restricted or not?)
more control over the permission migration scope (single pages, spaces, …)
migrating space and global permissions (those are currently not migrated and need to be set manually on the SharePoint site or sub site level)
allowing to configure SharePoint permission levels to be used in the target page (currently the out-of-the-box permission levels are used)
regularly updating SharePoint permission based on a configurable schedule

Features that will be hard or impossible to add:

migrating arbitrary hierarchies of page restrictions

Nested pages and restrictions

Hierarchical page restrictions from Confluence are difficult to handle because nested pages don’t exist in SharePoint. For SharePoint, the restriction hierarchy of a page needs to be combined and applied to a single SharePoint page.

When only user restrictions are involved then merging a hierarchy of restrictions is relatively easy to do. But when it comes to groups which overlap in only some of their members it becomes nearly impossible to apply that to SharePoint.

Re-apply permissions after changing principal mappings

You can change the principal mapping that you configured in STEP 2, even during or after a permission migration.

To do so, proceed as follows:

in the WikiTraccs SharePoint site, change principal mappings via the Confluence User and Group Mappings list
in each migration target SharePoint site, delete entries from the Page Transactions table for pages that should have their permission configuration updated; otherwise WikiTraccs won’t update those pages
in WikiTraccs, choose Permission Migration mode and start the migration

WikiTraccs will now apply page permissions again, using the already migrated information about Confluence page restrictions, and the updated principal mapping.

Note

This does not update the already migrated page restriction information from Confluence. To get updated page restriction information from Confluence, you have to restart with STEP 1 and remigrate page contents, which will carry over updated page restriction information from Confluence.

Summary

With the next release WikiTraccs adds the ability to migrate page restrictions from Confluence to SharePoint.

You’ll be able to configure which Confluence users and groups correspond to which SharePoint users and groups. WikiTraccs uses this information when applying unique permissions to SharePoint pages.

This release marks a milestone from which further development will be driven by your feedback. WikiTraccs could support several additional scenarios and development effort will be directed to areas with the most demand.

Furthermore, user and group mappings will be the basis for the migration of page metadata like author and editor.

Give it a try

Give WikiTraccs a try and check out its transformation capabilities!

Quick start guide >>

Start today with WikiTraccs’ free Trial Version:

Download FREE WikiTraccs now

Or get in touch via email if you are interested in a demo. Give it 45 minutes and you’ll be up to speed on how WikiTraccs can help you.

WikiTraccs Quick Start Video out now!

By Heinrich Ulbricht | Sunday, November 27, 2022

You can have your first migration running in less than 30 minutes.

This quick start video is all you need:

Download FREE WikiTraccs now

New release - UI overhaul

This covers the latest release 0.0.222 and its changes.

By Heinrich Ulbricht | Sunday, November 20, 2022

Note

Click here to get the latest release. The Quick start guide has been updated to reflect below changes.

The user interface got some additions

The WikiTraccs.GUI user interface has been changed.

The settings have been grouped into steps: First configure the source, then configure the target, then test the configuration, and then start the migration.

Some configuration options are gone. You now use a SharePoint list to configure the Confluence spaces to migrate. This list shows basic information about all Confluence spaces and is filled by WikiTraccs.

Note: The idea behind using SharePoint lists for configuration has been described in more detail here: A new approach to configuring WikiTraccs

A big improvement are the progress indicators. You no longer have to rely on the console output of WikiTraccs to see progress. Instead, you get a progress bar for overall page progress and the progress of each migrating Confluence space:

Migration progress

Laying the foundation for user and group mapping, and permission migration

Starting with the current WikiTraccs release 0.0.222, you need to create a SharePoint site dedicated to WikiTraccs. This is the “WikiTraccs site”. You use this site to configure WikiTraccs, like choosing the spaces to migrate. And WikiTraccs uses the site to store metadata about the migration.

This site also holds two SharePoint lists, Confluence User and Group Mapping and Confluence Permission Snapshots. You will notice that those will be filled while migrating.

The current release does not yet support user or group mappings, or permission migration. But a future release might make use of this data.

Fixes and improvements

When migrating pages with large file attachments WikiTraccs could run into timeouts. This has been fixed.

Parallel file downloads have been added. WikiTraccs now loads two attachment files from Confluence at the same time, optimizing for large and small files.

The list of known edge cases in Confluence page content transformation has been expanded.

The retrieval of Jira issue information has been improved for pages with a large amount of linked Jira issues.

Summary

This release is a big step and lays the foundation to add two big missing features: user and group mapping, and permission migration. I’m excited for the next releases as they will bring WikiTraccs further to what one would expect from a migration tool.

Give WikiTraccs a try and check out its transformation capabilities!

Quick start guide >>

Start today with WikiTraccs’ free Trial Version:

Download FREE WikiTraccs now

I’m also eager to get feedback, so get in touch when you try it: Contact.

Measuring page migration success

This post is about the metrics that WikiTraccs provides about migrated Confluence pages.

By Heinrich Ulbricht | Saturday, October 22, 2022

Which metrics measure page migration success?

Each Confluence page consists of content, formatting, layout, attachments, links, mentions and more that has to be handled when transforming the page to SharePoint.

WikiTraccs calculates the following metrics for each migrated page:

the percentage of characters in the page that was migrated
the number of unknown macros
the number of links that could not be resolved
the number of migrated images
the number of user references in the page that could not be resolved
the number of transformation errors

Those numbers are saved as page metadata.

A SharePoint view for the Site Pages library can be used to visualize those numbers. Here is a migration result indicating success:

Successful page migrations

WikiTraccs creates a view Recent Pages (WikiTraccs) that shows all this information.

Source content issues

Below image shows the result for a page where not everything could be migrated. Not all images could be transferred. And there were two links to inaccessible pages:

Missing images and inaccessible pages

Errors like missing images and inaccessible content can often be solved by fixing permissions or broken content in the source Confluence system.

Also note the “3” in the “Crystal Ball Transformations” column. This means there were 3 macros that WikiTraccs doesn’t know yet. It used a generic transformation approach for those macros.

Note

See the list of known macros here: Known Confluence Macros.

Transformation issues

Sometimes WikiTraccs cannot handle content properly (for whatever reason):

Migration errors

Above image shows that 12 elements in a page could not be transformed and only 51% of the page’s text content made it to the SharePoint page.

Those issues can be caused by faulty Confluence pages triggering transformation errors. There are already dozens of special cases and quirks that WikiTraccs works around. But there will always be cases that are new and need to be looked into.

For such a page, click on “View Entries” to see the content that cause the transformation issue. And ultimately a good thing to do might be opening an issue on GitHub.

Summary

In this post we looked at the metrics WikiTraccs uses to highlight migration success and migration issues. Everything being green is a good indication of a successful migration.

WikiTraccs counts content characters in each Confluence page and again counts characters in the target SharePoint pages. The result will be reported as “Text Transferred Percent”. This percentage is a good indicator if content is missing or not.

A new approach to configuring WikiTraccs

This post is about configuring WikiTraccs with SharePoint lists, a different approach to the GUI-based one planned so far.

By Heinrich Ulbricht | Sunday, October 16, 2022

So far my plan was to establish WikiTraccs.GUI as a visual frontend for configuring WikiTraccs.Console.

Note

See WikiTraccs GUI vs. WikiTraccs Console to learn the difference between those two.

But I decided to change that approach a bit. WikiTraccs.GUI will still be application you start transformations with. But the configuration will mainly be done in SharePoint.

How is SharePoint used to configure WikiTraccs?

WikiTraccs stores information about the migration into SharePoint lists. You will create a site collection for WikiTraccs to use. This could be called “WikiTraccs Store” and holds data during the migration. It can be deleted after the migration.

When starting WikiTraccs it will create several lists there. One is the Confluence Space Inventory list. This list stores information about all spaces in Confluence, like space name and space key. But it also allows you to select the spaces to be migrated:

Confluence Space Inventory List in SharePoint

Above image shows the Confluence Space Inventory list that shows some spaces from a publicly available Confluence instance.

In the image on the right side, marked with a red rectangle, are two columns that allow you to:

mark a space for migration via Yes/No column
specify the target site for the migration via a text column to put the target site URL in

More columns might follow as needed. (Note: the column names are still technical and are yet to be named properly.)

Why use SharePoint for configuration?

One benefit of using SharePoint lists for configuration is that they already provide a way to display tabular data, like the space inventory. This data can be sorted and filtered out of the box.

List data can be exported to and imported from Excel files for others to review. So the list of spaces could be given to stakeholders to decide which spaces should be migrated. Then they are imported again and WikiTraccs migrates only the selected spaces.

Using SharePoint as a user interface also means there is less potential for programming errors in WikiTraccs since the interface is owned by Microsft.

SharePoint lists provide a familiar user interface, if users are already used to working with SharePoint. There is no new UI that has to be learned.

The data can be shared with select stakeholders by using SharePoint list or list item permissions.

You are free to run WikiTraccs on any machine, even multiple ones, and change the machine at will. Every instance of WikiTraccs will operate on the same configuration.

And last but not least you have the SharePoint Search at your disposal to locate information you need.

Wrap up

That’s it for the latest update. What do you think about using SharePoint lists for configuring a migration? Get in contact and let me know!

Stay in the loop!

Are you interested in using WikiTraccs once it leaves early access? Drop me a direct message on X @wikitraccs or via email and I’ll get back to you as soon as it’s ready.

There are more lists that are relevant to WikiTraccs that will be covered in future blog posts:

Confluence User and Group Mapping
Confluence Permission Snapshots

As the names suggest those are relevant for mapping Confluence users and groups to SharePoint and to ensure the permissions and metadata can be rebuilt properly.

Note

The features described in this blog post are not yet included in release 0.0.206 and will be contained in a soon-to-be-released release.

Announcing WikiTraccs

WikiTraccs migrates content from Confluence to SharePoint Online

By Heinrich Ulbricht | Saturday, October 01, 2022

I’m delighted to announce the first public release of WikiTraccs.

What is WikiTraccs?

WikiTraccs is a migration tool for Confluence to SharePoint Online migrations.

Tip

Want to try WikiTraccs right away? Try the quick start guide.

Which problem does it solve and who needs it?

WikiTraccs makes migrating content from Atlassian Confluence to SharePoint Online easier. It contains hundreds of transformation rules for a wide range of Confluence formattings, structures and macros.

A migration from Confluence might be beneficial for:

users of Confluence Server, since Atlassian will eventually phase out Server, urging users to migrate to Confluence Data Center or Confluence Cloud
users of Confluence Data Center, because it only buys some time before migrating to the Atlassian cloud might be inevitable
users of Confluence Cloud, because of license costs

Often Microsoft 365 licenses are available already, covering the use of SharePoint Online modern pages to display wiki content. In this case SharePoint Online is a potential target for the migration.

Is it ready for production?

No, not yet. The current release is an early access version. This means the basics work but a lot is still to be done.

The focus of this first release is to migrate content and structure of Confluence pages as reliable as possible. This includes:

Text
Formatting like bold, italics, underline etc.
Text colors and background colors
Headings
Task lists
Emoticons
Complex tables
Images, internal and external
Mentions
Page and attachment links, on the same page and to different pages and spaces
Layouts with multiple columns and rows
Macros - a wide range is supported out of the box, for unknown ones generic rules are applied

It has been tested on thousands of Confluence pages and dozens of gigabytes of images and attachments. Since every Confluence is unique with regard to its content there will be cases not covered. But those will be covered over time as feedback comes in.

WikiTraccs keeps track of every word that is present in a source Confluence page and checks if it is present in the target SharePoint page as well. It highlights pages where this is not the case. Thus pages with lacking transformation success can be identified easily and immediately.

The Feature Overview page will eventually cover all supported and not yet supported types of Confluence content and metadata.

You can also read more about how WikiTraccs works in How does WikiTraccs work?.

Which Confluence versions are supported?

WikiTraccs supports Confluence 6 and up, including Confluence Cloud. Confluence 5 and older are not supported.

How do I get started?

Have a look at How to get started?

If you want to get in touch have a look at the options you have for logging issues, start discussions or just give feedback.

What’s the meaning of “WikiTraccs”?

“WikiTraccs” is short for “Wiki Transformation Accelerators”. WikiTraccs for Confluence should indeed accelerate things compared to a manual migration.

There are a lot of interesting topics to be covered which will be done in a future posts!

New Releases

WikiTraccs 1.27.16

By Heinrich Ulbricht | Wednesday, May 21, 2025

Details

new: when a space homepage is migrated the site homepage will be set to this page (#99)
improved: hyperlinks stuck together if there was no space in-between; a space character is now inserted to create a visual gap
improved: “unmigrated-wiki-markup” macro content is now formatted better (although it might still look different from Confluence as Confluence seems to interpret and render the old wiki markup, which SharePoint doesn’t do)
improved: longer streaks of empty paragraphs (which are not visible in Confluence, but in SharePoint Online) are shortened, to fix large gaps
improved: block quotes that seem to contain source code are transformed to monospace-formatted paragraphs instead, to look like source code in SharePoint

Prerelease Evolution

v1.27.0 → v1.27.7

new: migrating Comala Document Management metadata (workflow name, state, due date) as “labels” so they are available in SharePoint Online (note: only Comala Document Management on-premises is supported, not Confluence Cloud)
new: all pages of a draw.io diagram can now (optionally) be exported as image to SharePoint Online; read details in the blog post: WikiTraccs Creates Images for Draw.io Pages
improved: labels of the “content by label” macro configuration are now made available as dedicated macro property to aid macro templating (see \Templates\Transformation\off_contentbylabel.hbs for a sample)
improved: updated table style to latest SharePoint Online state

v1.27.7 → v1.27.8

improved: when a draw.io macro links to an old diagram revision, a note is now placed above the preview image and the page images section noting that the images show the latest version

v1.27.8 → v1.27.13

new page editor format (CK5) is now the DEFAULT for new WikiTraccs deployments; needs to be enabled in the settings for existing deployments
new: exporting Table Chart macro (by Stiltsoft) as image
improved: transformation of complex nested Aura macros (panels, tabs, etc.); still lots of room for improvement, but then there is only so much you can do in SharePoint pages
fix: a draw.io macro in a section created by a tab macro could split the section and the section title disappeared
fix: in Confluence Cloud switched back to v1 content export endpoint since v2 is somewhat unreliable
fix: BPMN Modeler macro transformation to image failed when macro was pulled in from another page via Excerpt Include
fix: overly long transformation log could prevent page creation in SharePoint (hitting the 2 MB limit)

v1.27.13 → v1.27.15

new: transforming Display Metadata macro (by Communardo) so that the metadata is shown on the SharePoint page
improved: some adjustments to @-mention transformation

v1.27.15 → v1.27.16

new: migrating page metadata by the Metadata for Confluence app (by Communardo); metadata will be migrated as page labels (note: only on-premises is supported, not Confluence Cloud)

Note: I recommend to opt-in to the new (prettier) SharePoint page format, if you did not do that already.

WikiTraccs 1.26.18

By Heinrich Ulbricht | Monday, April 07, 2025

Macro and Transformation Template Enhancements

new: transformation of MultiExcerpt macros (by Appfire):
- Multiexcerpt
- Multiexcerpt include
- note: one restriction: Multiexcerpt include macros that use the useIncluderPageContext parameter are not supported (yet)
new: supporting additional emoji types, the ones chosen from the exhaustive emoji picker
new: generating web parts from macros is now possible, tutorial videos for two web parts have been added: Convert Macros to SharePoint Web Parts
improved: macro transformation templates have more capabilities, the documentation has been updated
improved: when starting WikiTraccs.GUI, it will now check if there are macro transformation templates that cannot be processed; an error dialog will be shown in that case
improved: faster handling of large spaces (15.000 pages and up)
- WikiTraccs now uses the fast page retrieval endpoint that is available as of Confluence 7.18 (closes #69)
- this makes getting the list of pages per space much faster (tested with up to 40.000 pages per space)
improved: WikiTraccs now explicitly logs if the preview image of a draw.io diagram is missing
- note that the missing preview image continues to be logged as missing image (in general) and that the KPI columns in the Site Pages library indicate missing images as well
fix: lingering bug that could cause SharePoint page creation to fail intermittently

Prerelease Evolution

v1.26.1 → v1.26.5

new: supporting additional Mosaic table macros; tables should now mostly be transformed to proper tables in SharePoint
- thead
- tbody
- th
improved: adding a workaround for a Confluence Server issue where the server returns server error 500 when trying to retrieve attachment information; now a CQL-based alternative query will be used to (hopefully) work around such issue

v1.26.5 → v1.26.6

improved: login detection for Confluence Interactive authentication mode should now better handle additional MFA pages in the login flow that are inserted by SSO solutions
preparations for link fixer implementation (#152), see details in #154 - note that this will trigger a “preparation” of the WikiTraccs and all migration target sites, as additional fields are added to lists (a one-time operation per site that will be run automatically by WikiTraccs and that will take about a minute)

v1.26.6 → v1.26.9

improved: draw.io macro transformation
- Embed draw.io macros that reference the source draw.io macro via URL (instead of page ID) are now properly transformed
- improved results when the source page (with draw.io macro) and consumer page (with Embed draw.io macro) are migrated to different SharePoint target sites
- for Embed draw.io macros, a link to the source page that contains the draw.io macro will be added (in case users need to access the original macro files)
- improved handling of draw.io macros that have a display name set
- handling of macros with missing aspectHash parameter which is required to find the corresponding preview image attachment
improved: some paragraph formatting improvements where the margin was too low

v1.26.9 → v1.26.11

new: WikiTraccs can bring missing draw.io preview images back, see the blog post and the video: WikiTraccs Creates Draw.io Preview Images
new: logging of current memory load in WikiTraccs Console and GUI; when diagnosing unexpected migration interruptions this will hopefully provide useful insights
improved: handling of broken tables

v1.26.11 → v1.26.13

improved: reduced memory footprint of draw.io preview image generation
fix: draw.io preview image generation failed for diagrams with megabytes of embedded images

v1.26.13 → v1.26.15

improved: when creating draw.io preview images, WikiTraccs now tries to detect if external resources (images, fonts, …) cannot be downloaded and writes a warning to the log file if that is the case
fix: macro ignore list wasn’t applied to all macros

v1.26.15 → v1.26.18

new: supporting BPMN Modeler Enterprise macro (by viadee Unternehmensberatung AG) (#158); an image will now be shown where the diagram used to be, instead of a text placeholder

Note: I now recommend to opt-in to the new page format, if you did not do that already. One of the next releases will have this on by default.

WikiTraccs 1.25.11

By Heinrich Ulbricht | Thursday, February 27, 2025

More Macros Release

This release focuses on supporting many more popular macros:

new: transformation of Mosaic macros (by Adaptavist)
- Horizontal Navigation Tab + Pages (if not nested in e.g. section or table)
- Background
- Table, Table Row, Table Cell
- Bibtex Reference
new: transformation of AURA macros (by Seibert - appanvil)
- Tab Group + Tab (if not nested in e.g. section or table)
- Background
- Cards
- Panel
- Button
- Title
new: transforming the Navitabs macro (by Communardo Products GmbH)
new: transforming the Content Report Table macro using the static snapshot approach
new: transforming the Content By Label macro using the static snapshot approach
new: transforming the Excerpt Include macro; content will be copied and marked as copy
improved: hard link transformation now handles relative hard links
improved: table content is now vertically top-aligned (non-standard table transformation needs to be active)
improved: added hidden page properties macro to default macro ignore list
improved: macro transformation templates have more capabilities, more details will follow
improved: styling improvements (new page format only)
- apply overridden link colors, better first-heading margins
- wider margins between paragraphs and lists, tables, and images
fix: Table of Contents references to headings containing “@” were broken

WikiTraccs 1.24.20

By Heinrich Ulbricht | Sunday, February 09, 2025

Page Formatting and Styling Upgrade

This release brings rather large improvements to page formatting and styling (note: opt-in, has to be activated).

WikiTraccs can now create pages in a newer SharePoint page format that provides more styling capabilities.

Learn how to opt-in and more in the blog: Harnessing the New SharePoint Page Format

Immediate advantages of the new page format:

new: migrating exact text colors from Confluence to SharePoint
improved: using one of the new citation styles that looks more appealing (left-aligned text with a left-side bar instead of italics and centered)
improved: better paragraph margins

Other Improvements

Confluence On-Premises & Cloud

new: transform draw.IO Board/Sketch macro
new: Roadmap macro transformation to proper image
new: enabling non-standard table transformations by default, since merged columns and table cell colors just look too good to miss out on them
new: migrating status metadata for pages; status can be “current” or “archived”
improved: beautified info/warning/note/tip macro transformation, leveraging table cell background colors
improved: table transformation (better column width calculation)
improved: adding additional hard link formats
improved: whitespace cleanup; now leaving more gaps between paragraphs where this might’ve been intentional
fix: handling of attachments which only differ in ß and ss since SharePoint treats those as equivalent
fix: image title not set after transformation in Confluence 8
fix: for some Confluence instances, comments migration failed with 500 server error; a workaround has been implemented
fix: content selector preference issue when using a combination of content ID and space selectors, when a page is covered by both; content ID selector will now take precedence

Confluence Cloud

new: added a setting to switch off migration of archived pages

Prerelease-Specific Evolution

v1.24.4 → v1.24.5

improved: normalization of more “empty paragraph” variants which carry over from Confluence and often cause larger gaps in the page
fix: code macro embedded in single table cell with no other text can be missing in pages of new page format (note: detected by page KPI “Text Transferred Percent” being below 100%)

v1.24.5 → v1.24.7

fix: “Text Transferred Percent” KPI sometimes miscalculates the title characters of images, Gliffy, and draw.io diagrams
fix: wrong image size for draw.io diagrams when the image size has decimal places (like 400.5) and the local system uses , as decimal character

v1.24.7 → v1.24.9

fix: content of multiexcerpt macro not migrating when it contains only plain text (note: detected by page KPI “Text Transferred Percent” being below 100%)
fix: rare case of image being missing on page when it is the only content in a page section (note: detected by page KPI “All Images Transferred” being false)

v1.24.9 → v1.24.10

fix (Cloud): whiteboard image can be missing with new page format
fix (Cloud): Page Tree macro transformation did not resolve links to non-page content types

v1.24.10 → v1.24.12

improved: transformation of Include Shared Block macros
- page links and image referenes in included shared blocks are updated to still point to their valid location
- including shared blocks without specifying a key now works, as long as the source page only contains one shared block

v1.24.12 → v1.24.17

new: adding text background color conversion for new page format
improved: handling of broken HTML tables
improved: additional emoji transformations (flag, red minus)
fix: in the new page format, inline images that are followed by empty paragraph might be missing (note: detected by page KPI “All Images Transferred” being false)

v1.24.17 → v1.24.18

new: adding PlantUML macro transformations (plantuml, flowchart, spacegraph, linkgraph, database-structure, database-info)

v1.24.18 → v1.24.20 (available as of 2025-02-25)

new: additional PlantUML macro transformations (plantumlrender, flowchartrender)
improved: table of contents transformation now handles numbered TOC entries when matching with unnumbered headings
fix: some character count edge cases
fix: extended user info lookup could fail on Confluence 9 blocking page migration

Why so many iterations? The new page format is a rather big change, that’s why WikiTraccs is test-migrating content every day to discover lurking issues. Those are addressed along the way, while adding new transformations as well.

WikiTraccs 1.23.16

By Heinrich Ulbricht | Monday, January 27, 2025

Improvements

Confluence On-Prem & Cloud

new: WikiTraccs migrates macros from the Include Content app by Keysight Technologies
- the shared block macro content is now migrated as is, without surrounding text placeholders
- the include shared block macro is now migrated as copy of the referenced “shared block” macro, instead of just a text placeholder
improved: user @-mention link now contains user reference (like Confluence key or username) that can be used to identify and update the link in the future
improved: connection check for chromedriver site got a retry as it sometimes failed with server error
improved: monospace formatting is now preserved even when being on one line with differently formatted text
fix: the cookies.txt authentication workaround was not compatible with Confluence Cloud; this has been fixed and the documentation has been updated as well
fix: user @-mention link in page not properly generated if there is no email address present for Confluence user (even when mapping to Entra ID user is present that would provide email); now the Entra ID user’s email is used when generating the link
fix: edge case where a user mapping would not be applied if the local cache contained the Confluence user’s email, but the user mapping list not

Confluence On-Prem

improved: retrieval of Confluence user’s email address from Confluence via alternative endpoints; this improves auto-mapping to Entra ID accounts

Confluence Cloud

fix: whiteboard import working again; needed to be adjusted to changed HTML structure

Preparations for New Page Editor Version Support

This release contains preparations to create pages using the latest version of the page editor as this feature should now be fully rolled out to all tenants by Microsoft.

Background: Microsoft started updating its page editor around the end of 2023 and continued improving it throughout 2024. Pages that are or have been created in the previous format are updated as soon as they go into edit mode the first time. You see that with migrated pages as there will be a small delay the first time you edit a page. This is JavaScript running and upgrading the page to the latest version.

WikiTraccs will soon start generating pages with the latest version. Going forward, this makes it possible to leverage the full potential of additional formatting options and new web parts.

You shouldn’t notice any changes, yet. The switch will be announced and there’ll be a toggle button to switch back if necessary.

Progress is tracked here: #148

New: WikiTraccs for Markdown

WikiTraccs gains the ability to publish Markdown files to SharePoint. This release includes an early implementation to gather feedback.

Join the waitlist and receive updates.

Read the blog post to learn how to test: Testing WikiTraccs for Markdown

WikiTraccs 1.22.11

By Heinrich Ulbricht | Monday, December 16, 2024

New: Live Progress Indicators

This release introduces an often asked-for feature: live metrics for the migration which show the current migration speed which also might indicate how long the migration will take.

Learn more in the documentation: Live Progress Indicators.

And here’s a blog post showing some test migrations and their metrics: How much time will a Confluence to SharePoint migration take?

Note: metrics will be shown in “migrate content” mode, the other modes are not yet supported.

New Features & Improvements

Confluence On-premises & Cloud

new: the number of pages for a space (or more broadly: selector) and the number of migrated pages are written back to the Space Inventory SharePoint list, making it easier to see progress per selector
- look at the WT_In_CfPageCount and WT_Out_CfTransformedPageCount columns (note: you might have to add these to the list view)
- the values are updated once when starting and once when finishing the migration, at the same time the progress log files are written
new: adding support for the Children Display macro; it is now transformed as static snapshot as well (like the Table of Contents and Page Tree macros)
- note that the snapshot will always show links, no page previews
new: adding support for colored and merged table cells in SharePoint; read the blog post: Making SharePoint tables look pretty.
new: warnings and errors are saved to a separate log file, making analysis easier (note: this is a copy, they also can be found in the big common log file)
improved: supporting additional page hard link format for link transformation (homepage link)
improved: supporting additional space hard link format
fix: connection test fails for anonymous authentication mode on some sites, as the JSESSIONID cookie seems to be missing
fix: space hard link transformation
fix: “update author and editor” mode was broken due to header-related assumptions introduced in release v1.21.11

Confluence Cloud

improved: hard link transformation now transforms links to new content types (whiteboard, …)
improved: modification dates for new content types (whiteboard, …) is now retrieved as well and used to detect changes on consecutive migration runs (delta scenario)

WikiTraccs 1.21.11

By Heinrich Ulbricht | Tuesday, November 26, 2024

WikiTraccs switches to Confluence Cloud REST API v2 (mandatory as of December 2, 2024 March 31, 2025)

Note: This topic is only relevant when migrating from Confluence Cloud. Confluence Server and Data Center are not affected.

WikiTraccs switches over from Confluence REST API v1 (“on-prem API”) to v2 (“cloud API”) which is a pretty big change (#118). Atlassian is removing critical v1 endpoints on Dec 2, 2024 and forces app vendors to use v2, although this new version requires significantly more API calls (which makes it slower) and is more cumbersome to use.

WikiTraccs has been undergoing intensive v2 adjustment and testing over the the last couple of months. So far it used mainly v1 endpoints but now switches to v2.

Ideally, there should be no differences for users of WikiTraccs.

Here’s the announcement in the blog: WikiTraccs switches to Confluence Cloud v2 REST API.

Note: Atlassian extended the deadline one day after this release.

WikiTraccs brings the margins back to text web parts

SharePoint pages are now created in a way that brings the margins back to text web parts.

Some time in 2024 Microsoft changed something in text web parts. Margins between paragraphs disappeared gradually. Now the margins are nearly always missing, making paragraphs of text stick together more than they should. You’ll notice that with pages migrated so far. The margins can be brought back by editing the pages and changing the style from “No spacing” to “Normal”.

From now on, WikiTraccs sets a flag on pages it creates that brings the margins back by default, although they are a bit larger compared to manually created pages. This has technical reasons and will be tackled in a future release.

Only if you don’t want the margins back, you activate the old behavior with the following configuration in appsettings.json:

{
  "CustomSettings": {
    "Features": {
      "GlobalRichTextStylingVersion": 1
    }
  }
}

Other Improvements

Confluence On-premises & Cloud

improved: SharePoint pages now have their First Published Date set, as well as the Author; this makes migrated Confluence blog posts appear at the right place in news web parts (previously they would show up as new)
- the First Published Date is set to the Confluence page creation time
- the Publisher is set to the creator of the Confluence page
- the change should also be noticeable in the SharePoint page header, when using a SharePoint page template that shows this information
improved: attachments that are reported as “not found” by Confluence when downloading (although they are listed as page attachment) are now logged as transformation error
fix: handling of attachments where the name only differs by Unicode Normalization Form; WikiTraccs renames the quasi-duplicates because SharePoint doesn’t support that; also, a transformation error will be logged
fix: handling of duplicate attachments; same procedure as above

Confluence Cloud

fix: nested task list items could be missing after transformation since the task list format changed in Confluence Cloud; affected SharePoint pages can be identified by the “Text Transferred Percent” metric being below 100
fix: pages with empty title now migrate properly; the page file name will contain the word “MISSINGTITLEMARKER” where the title usually is located (so far I know of one confirmed case of a page with empty title)
fix: looking up the target space id for a link could return a wrong one if the target space doesn’t exist
fix: a link to a database or whiteboard could prevent page creation
fix: enhanced user profile macro transformation now also recognizes user account IDs

WikiTraccs 1.20.43

By Heinrich Ulbricht | Saturday, October 26, 2024

Improvements to Hard Link Transformation

Support for hard link transformation has been extended. For both Confluence on-prem and Cloud hard links are now properly transformed to SharePoint links when using API Keys to authenticate with Confluence.

Furthermore, support for additional Confluence Cloud hard link variations has been added (solves #117 and #119).

Read the blog post for more information about hard links and soft links: Confluence Link Types Explained.

Other Improvements

Confluence on-prem & Cloud

improved: adjusting log output for common errors to be more sensible with regard to log levels - makes checking the logs easier
improved: adding handling for rare case of Confluence failing to covert a table of contents macro
improved: more log output regarding Entra ID authentication and SharePoint site access to facilitate diagnosing permission-related issues; samples have been added to the troubleshooting section of the Registering WikiTraccs as app in Entra ID blog post
fix: content of expand macro, that is contained in a table, that only contains images, might be missing on transformed page; the “All Images Transferred” metric indicates this

Confluence Cloud

improved: user name resolution for @-mentions via user key (this affects pages that have been migrated from Confluence on-prem to the cloud) (solves #114)
improved: adjusted info emoji for info panel to look better
improved: handling additional hard link variation (edit link to page)
improved: transformed Jira issue lists now use the configured selection of columns and the display name instead of internal name
fix: some pages have no position set, making their order undefined (which would affect WikiPakk); falling now back to manual on-prem order logic (might only affect migrated pages; technical details)
fix: Whiteboard export might fail due to the page properties button having a changed structure
fix: Jira issue lists won’t be transformed if they contain multiple links in a column

WikiTraccs 1.20.31

By Heinrich Ulbricht | Saturday, October 12, 2024

Confluence Cloud - Migrating Whiteboards

This release is capable of migrating Confluence Cloud whiteboards to SharePoint Online.

Whiteboards will be exported as image, the SharePoint page will show this image. Whiteboard export is currently opt-in and needs to be enabled in the settings. It is in experimental state and needs to evolve as the number of whiteboard migration grows.

Note: Atlassian does not provide an official way for applications like WikiTraccs to export whiteboards automatically and at scale. WikiTraccs thus resorts to browser automation. It will open whiteboards in a browser and automatically “click” the buttons needed to export the whiteboard as image. This method can work well, but is prone to breaking if Atlassian changes the user interface. This will need to be adjusted as needed - until Atlassian provides a proper export endpoint. This ask is tracked in CONFCLOUD-76634.

Note: WikiTraccs cannot yet detect empty whiteboards and those fail to export. This is a known limitation.

Other Improvements

Confluence Cloud and On-prem

improved: image scaling; image sizes now match the source more closely, and scrollbars in tables are prevented
improved: only accounts of page authors were added to the user mapping table, now the last editor is added as well
improved: better handling of more complex link contents
- sometimes you might see this in pages, where WikiTraccs wasn’t able to handle a link that had some “content” in it:
- this should now look like this instead, which helps de-cluttering pages:
improved: converting three-column layouts with empty columns left and right to one-column in SharePoint, to use available space
fix: page tree macro conversion fails if the search box is on in the macro settings
fix: images that link to PSD files were converted to text placeholder; will become attachment links now
fix: SVG image scaling to zero size issue, making images appear missing
fix: file names with control characters could prevent a page from being created (how does “Ctrl+C” end up in a file name?)
fix: special characters in commentators names could make page transformation fail
fix: link resolution might fail for ID-only links that miss the target content type

Confluence Cloud

new: added transformation for decision lists
fix: modification date and author for new content types (whiteboard, database, etc.) were missing in the last prerelease
fix: emoji conversion - cloud-specific ones are now handled correctly (#27)

WikiTraccs 1.20.15 (Prerelease)

By Heinrich Ulbricht | Saturday, September 28, 2024

General Improvements

new: the Include Page macro is transformed to a link to the corresponding SharePoint page
improved: removing empty entries from transformed Table of Contents macro

Confluence Cloud - Preparing for Migration of new Content Types

This release adds further support for the new Confluence content types like whiteboard, database, folder, and smart link. Those will not be migrated yet. But a new SharePoint page will already be created, although an empty one. Future releases of WikiTraccs will add migration support for those new content types.

Migrating the new content types to empty pages has already one benefit: all metadata will be present in SharePoint that is required to rebuilt the Confluence page tree. If you use WikiPakk to show the Confluence page tree in SharePoint, this will now show the whole tree, regardless of content type.

Proxy Mode Improvements

Note: Those changes only apply if you have the “Proxy Confluence API calls through browser” setting activated.

improvement: automated browser is more resilient against accidental closing; will reopen
fix: relative link issue in proxy mode with cloud site that redirects after logging in
fix: modified JavaScript to handle missing page body (when page loads slowly)
fix: adding CORS error handling
fix: failed external file downloads could cause a lot of retries with way too long timeouts

Updating to .NET 8

Microsoft is deprecating .NET 6 in a couple of weeks, forcing the update to .NET 8. This should make no difference from an end-user point of view.

WikiTraccs 1.20.6 (Prerelease)

By Heinrich Ulbricht | Sunday, September 15, 2024

New transformation capabilities

new: page header template support; here’s how:
1. create a SharePoint page template with name migration-template.aspx (that is the file name, not the page title; the title of the template page doesn’t matter)
2. all newly migrated pages will now get the same header as the template page in the same site
- note: this is a per-site configuration; each site can have their own migration-template.aspx that applies to all pages migrated to the same site; sharing a template across sites is not possible at the moment
new: macro placeholder customization using transformation templates
- macro placeholders can now defined via Handlebars template
- that means you could now get rid of the 🚧 placeholder and define your own placeholder text, for each macro
- Documentation: Macro Placeholders and Transformation Templates
new: adding explicit support for language macros from Bitvoodoo’s Translation for Confluence

Lots of small improvements (PART 3)

improved: better handling of certain cases of nested sections, columns, and macros
improved: support for horribly broken macros in pages; WikiTraccs tries some fixing
improved: memory management to better support long-running migrations under low-memory conditions
improved: more diagnostic output for rare “Not a GET query” connection issue
improved: Draw.io embed macro transformation now supports diagrams embedded from other pages
fix: nested aui tab macros screw with the “handled macro percent” metric
fix: “non breaking space” unicode character in page titles breaks attachment links
fix: images with links were not immediately clickable, but required a page edit first
removed the “Use M365 PnP Client ID” shortcut button as Microsoft retired the PnP Entra ID application on September 9, 2024

Confluence Cloud

improved: panel transformation now takes the panel’s emoji and color into account

WikiTraccs 1.19.13

By Heinrich Ulbricht | Thursday, August 15, 2024

Lots of small improvements (PART 2)

Confluence on-prem & Cloud:

new: migrating image captions
new: handling Spreadsheet macro
improved: supporting links on images that start with //
improved: handling images that link to non-image files instead of an actual image (like .msg email file); those will be converted to simple links
improved: handling of attachment thumbnails; thumbnail images will be downloaded and treated as additional attachments of the page they appear in
improved: better error handling and display for connection tests in WikiTraccs.GUI
improved: Selenium proxy mode now only downloads one Confluence attachment at a time to increase resilience
improved: marker for added table cells
improved: handling of even more non-standard tables
fix: issue with generated links when more than one view-file macro is on one page
fix: handling of inaccessible Jira issue lists as anonymous user
fix: Selenium proxy mode could show mixed-content warning when downloading http resource; will upgrade to https and try again
fix: Selenium proxy mode could run into timeouts for larger attachments
fix: lowering the number of IDs per CQL query for content ID selectors even more to 200 to work around Server Error issue
fix: content of non-standard list content in nested lists could be missing (note: this would be indicated by < 100% text transferred metric)
fix: handling of empty table filter macro - it will now be removed
fix: empty table of contents macro needlessly transforms to text placeholder - will now be removed

Confluence Cloud:

improved: link transformation for inline card links

WikiTraccs 1.19.5

By Heinrich Ulbricht | Sunday, August 04, 2024

Note: the previous preview release v1.19.3 introduced a so far unknown HTTP header conflict that made Confluence on-prem fail to convert the page tree and table of contents macros to HTML. This has been fixed in v1.19.5. Please update to the new release when it becomes available.

Lots of small improvements (PART 1)

Confluence on-prem & Cloud:

new: Table of Contents macro migration (as static snapshot)
new: transformation of Google Drive Link macro to regular link
new: setting storage locations (in WikiTraccs settings dialog)
new: setting to disable connection checks for troubleshooting purposes (in WikiTraccs settings dialog)
improved: removed “Add caption here” being displayed for transformed images (in SharePoint)
improved: transformation of .ico image files with certain MIME types
improved: empty code macro transformation result
improved: Confluence user name lookup in anonymous auth mode
improved: adding more image to emoticon conversions (to prevent smilies from being served as image files)
improved: Confluence connection tests (logging, error display, focus on supported TLS versions)
improved: default values when opening the settings dialog the first time (enabling comments migration and downloading of external images by default)
fix: wrong display size for images when EXIF rotation is set
fix: rare case of inline images making code macro transformation fail
fix: lowering the number of IDs per CQL query for content ID selectors from 500 to 300 to work around Server Error issue
fix: page title with line break prevents page creation in SharePoint

Confluence Cloud:

new: Panel extension transformation (#137)

WikiTraccs 1.18.19

By Heinrich Ulbricht | Wednesday, July 31, 2024

improved: removing more blocking user interface elements in automated Chrome browser, like the search engine selection dialog
improved: better duplicate page remediation; when duplicate page IDs are returned by Confluence (rare cases, but it happens), WikiTraccs falls back to “manual” page tree traversal which seems to reliably get all pages for a space
fix: adding option to (hopefully) work around the “DevToolsActivePort file doesn’t exist” error (#136):

WikiTraccs 1.18.15

By Heinrich Ulbricht | Wednesday, July 24, 2024

new: “Embed draw.io Diagram” macro is now supported, migrates just as the “draw.io Diagram” macro
improved: applying Gliffy macro width to SharePoint image
improved: applying draw.io macro width to SharePoint image
fix: page and layer selected in draw.io macros were not respected; the corresponding image will now be shown in SharePoint

Note: this blog post has more information about Gliffy and draw.io: Migrating Gliffy and draw.io macros to SharePoint Online.

WikiTraccs 1.18.13

By Heinrich Ulbricht | Friday, July 19, 2024

Transforming even more colors from Confluence to SharePoint!

This covers text colors, the new text highlight colors, and table cell colors.

WikiTraccs 1.18.11

By Heinrich Ulbricht | Wednesday, July 17, 2024

Note: Please have a look at the last preview and beta releases as well for the complete list of changes.

new: support of US Government High (GCC High) cloud environment is now confirmed; the other environments should work as well (US Government (GCC), US Government DoD (GCC DoD), Germany, China)
- note: .us domains default to GCC; to select GCC High or GCC DoD, use appsettings.json
improved: delta migration input file parsing now recognizes the marker (“x”) even if not delimited by tab, but only by space
improved: toned down the log output for content ID selector
fix: delta migration fails with exception being shown in Console

WikiTraccs 1.18.10 (Beta)

By Heinrich Ulbricht | Thursday, June 20, 2024

This release contains two major new additions: Waves and the Content ID Selector.

Waves

Waves are used to partition the migration into different chunks. Those chunks can then be

migrated one after another
migrated in parallel by multiple WikiTraccs instances

Waves are numbers that you assign to source selectors in the Space Inventory.

When starting a Confluence to SharePoint migration you can tell WikiTraccs which of those waves it should migrate.

This article has all the info: Migration Waves.

Note: WikiTraccs will add a new column WT_Setting_Waves to the Space Inventory list.

Content ID Selector

So far WikiTraccs has two ways of selecting which content it should migrate: by space key and by CQL query.

This release introduces a new way of selecting content to migrate: by content ID. Give a list of page IDs and blog IDs to WikiTraccs and it’ll migrate exactly those.

The Content ID Selector allows for flexible selection of your source content. You’ll be able to get those IDs from the database (on-premises) or from the space report (cloud), filter as you like, and put the result into WikiTraccs.

This article has all the info: How to migrate Confluence Pages using Content ID Selectors.

Note: WikiTraccs will add a new column WT_Setting_ContentSelectorValue to the Space Inventory list.

Other changes

improved: better handling of Confluence Cloud base URLs that have more than just /wiki in them
improved: Selenium proxy mode status panel (in browser) can now be moved down to access controls beneath it
fix: Selenium proxy mode and the new Confluence connection watcher did not yet work well together
sharepoint.us URLs are now treated as GCC High environment (note: this changed in the next release); furthermore, a setting has to be configured via appsettings.json:
```
{
  "CustomSettings": {
    "CloudEnvironment": "USGovernmentHigh"
  }
}
```
Note: Create a new file appsettings.json, copy above text to this file, and store the file in the folder where the WikiTraccs.GUI.exe is located.

WikiTraccs 1.17.5 (Prerelease)

By Heinrich Ulbricht | Wednesday, June 05, 2024

This release fixes more Confluence Cloud-related issues as part of the WikiTraccs push to the cloud.

Confluence Cloud

fix: attachments macro transformation fails, generates text placeholder (#128)
fix: PDF macro is not converted to file link, but generates text placeholder (#129)
fix: table cell background color is not converted to color emoji anymore (#130)

Other Changes

new: Confluence connection monitor that continuously checks if Confluence is down or if the user session got invalid; migration is paused in those cases
fix: user mentions in transformed attachments macro still link to Confluence

WikiTraccs 1.17.7 (Preview)

By Heinrich Ulbricht | Wednesday, June 05, 2024

This release fixes more Confluence Cloud-related issues as part of the WikiTraccs push to the cloud.

Confluence Cloud

fix: attachments macro transformation fails, generates text placeholder (#128)
fix: PDF macro is not converted to file link, but generates text placeholder (#129)
fix: table cell background color is not converted to color emoji anymore (#130)

Other Changes

new: Confluence connection monitor that continuously checks if Confluence is down or if the user session got invalid; migration is paused in those cases
new: adding (possible) resilience for erroneous Confluence API responses when requesting the list of space pages
improved: general improvements of table-cell-color-to-emoji-mappings
fix: user mentions in transformed attachments macro still kind of link to Confluence
fix: verification mode creates new empty pages when already migrated pages have been renamed

WikiTraccs 1.17.4

By Heinrich Ulbricht | Friday, May 24, 2024

This release contains a lot of under the hood changes for Confluence Cloud API v2 compatibility (see issue #118 for details). This required some major changes with regard to spaces, as those need to be identified by ID and not alone by key.

Further changes:

new: setting to prevent attachment migration; this can be enabled via appsettings.json like so:
- ```
{
    "CustomSettings": {
        "Features": {
            "SkipAttachmentDownload": true
        }
    }
}
```
- note: setting SkipAttachmentDownload to true pretends that all Confluence pages have no attachments; page transformation will act as if attachments are missing when generating links to attachments and images that are expected to be there
- note: you’ll see the following log message per transformed page if the setting is active: “Skipping attachment download (by simulating empty attachment list) because Features.SkipAttachmentDownload is set to true | WikiTraccs.Console.Registries.ConfluenceContentRegistry”
improvement: speeding up page retrieval by caching already retrieved page contents
improvement: sometimes link transformation fails for linked-to pages that are not retrievable via title (for unknown reason); CQL-based retrieval has been added as workaround
improvement: reenabling inline images in lists as Microsoft fixed the underlying bug in the page editor
fix: space link in Space Inventory is incomplete for personal spaces
fix: page links in transformed attachment macro might be broken after the first link
fix: link transformation fails for links that point to a blog post by title and there are multiple blog posts with this title
fix: issue when downloading external images and the file name is extraordinarily long
fix: hard-link resolver tries to get page info for hard-linked page draft; those are now ignored

Confluence Cloud:

new: resolving Confluence data cards linking to Jira issue lists (interactive auth only, see #123 for details)

WikiTraccs 1.16.2

By Heinrich Ulbricht | Monday, April 08, 2024

new: supporting Brikit Theme Press migration to SharePoint Online modern pages
new: detection of Confluence attachment hard links and translation to proper SharePoint links
improved: transformation of nested ordered lists, where the nesting can be reduced
improved: WikiTraccs.GUI now shows notifications on start-up when appsettings.json or cookies.txt are used do modify authentification behavior
improved: added additional authentication detection for interactive cookie-based auth
fix: configurations via appsettings.json are not applied to WikiTraccs.GUI connection test
fix: the combination of cookies.txt, appsettings.json, and anonymous authentication is broken
fix: selecting pages via CQL query in the Space Inventory is broken
updating dependencies that have security advisories (ImageSharp, Azure.Identity, Microsoft Identity Client)

WikiTraccs 1.14.13

By Heinrich Ulbricht | Wednesday, April 03, 2024

improved: connection check for Confluence now also checks that space information can be retrieved to discover issues faster
improved: logging now includes Confluence version, to faster diagnose version-specific issues
fix: interactive login (cookie based) broken for Confluence Cloud

WikiTraccs 1.14.10

By Heinrich Ulbricht | Friday, March 29, 2024

The LinkTransformationMode introduced in the last release now covers more cases of hard links to pages and spaces. (Hard links are pure “text links” without any Confluence magic.) Note that hard link transformation is (and so far has been) supported for pages and spaces, not attachments. So attachment hard links are always migrated as-is.

WikiTraccs 1.14.6

By Heinrich Ulbricht | Wednesday, March 27, 2024

fix: on-prem Confluence groups are not always added to Confluence User and Groups list

WikiTraccs 1.14.8

By Heinrich Ulbricht | Wednesday, March 27, 2024

New Link Transformation Setting

This release introduces a setting to disable link transformation for unmapped entries in the Space Inventory.

Background: So far, all Confluence links (page links, space links, attachment links) are transformed to SharePoint links. If there are entries in the Space Inventory where the target SharePoint site address is empty, the link will be mapped to the default target site (the address entered in the blue WikiTraccs.GUI window, in step 4).

Now there is a setting to restrict link transformation to only those entries in the Space Inventory that explicitly have a target SharePoint site address set. Other links will point to Confluence.

The setting is available via appsettings.json:

{
    "CustomSettings": {
        "Features": {
            "LinkTransformationMode": "needexplicittargetsite"
        }
    }
}

Note: Hard links to Confluence spaces are not covered, yet. This is a known limitation and will be added in a future release.

Furthermore

fix: on-prem Confluence groups are not always added to Confluence User and Groups list (a regression introduced with the previous release 1.14.2)
fix: hard links to Confluence attachments were transformed to page links

WikiTraccs 1.14.2

By Heinrich Ulbricht | Friday, March 22, 2024

More Authentication Methods

Adding Personal Access Token as authentication method for Confluence, as well as Device Login for SharePoint Online.

Using Personal Access Tokens with Confluence makes the browser automation obsolete, that WikiTraccs so far relied on. This is available as of Confluence 7.9 and makes the authentication experience smoother. We are not dependent on the Chrome browser anymore. This also works with Confluence Cloud (where they are called API tokens).

With Device Login for SharePoint you can authenticate on a different workstation than the one WikiTraccs is running on. There are cases where M365 authentication causes challenges in certain environments and Device Login allows logging in on another workstation.

Performance Improvements

improved: page creation speed
fix: memory leak causing memory consumption to increase when running for days

There have been adjustments to the page creation logic, as now, while one page is being created, the next page is already being prepared in SharePoint. This can improve page creation speed (the raw page without attachments) up to about 25%. It is currently being monitored if this leads to increased throttling by Microsoft, as we are moving faster than before.

Confluence Cloud Compatibility Push

This release contains a major change regarding how users and groups are identified internally. So far, WikiTraccs used user keys and group names to identify users and groups. The cloud abandons the use of those and rather uses IDs everywhere. WikiTraccs can now handle those as well.

You’ll see a change in user- and group-related fields in the User and Group Mapping list, the Page Transations list, and the Site Pages list. Fields that used to contain user keys like abc123def now contain an identifier like user:key:atlassian:|onprem|abc123def (for users) or group:id:atlassian:|cloud|123456789 (for groups). Those identifiers explicitly specify the type, source, and (of course) value, which is vital for WikiTraccs when looking up and working with users and groups.

This change is backward-compatible. Identifiers that aren’t following the new format are assumed to be on-prem user keys or on-prem group names, respectively.

This change closes two issues which were open on the road to full Confluence Cloud compatibility:

The following issues are addressed at least partially:

The releases also gradually starts using Atlassian’s v2 API in the cloud.

Other Changes

change: SharePoint page comments are not disabled anymore for newly created pages, as this speeds up page creation; if you want to disable comments, please use this PowerShell script from the library: ToggleCommentsForPages.ps1
fix: path configuration via appsettings.json of WikiTraccs.GUI did not carry over to WikiTraccs.Console (attachment registry path, temp folder path)
fix: a current directory that differs from the WikiTraccs.GUI.exe folder could screw things up (appsettings.json not found, logs folder created somewhere else); now the current directory of WikiTraccs.GUI is forced to be the exe location
fix: a vertical tab character in a page’s title could prevent another page that link to that page from being created

WikiTraccs 1.13.1

By Heinrich Ulbricht | Thursday, March 21, 2024

fixed: certain empty tables could prevent pages from being transformed; those are now removed from the page

WikiTraccs 1.13.0

By Heinrich Ulbricht | Tuesday, February 27, 2024

new: downloading of external images (like: not stored in Confluence) can be disabled in the settings; if disabled, those images will be transformed to links

fix: some progress bars went missing in the prerelease; those are back
fix: fixed a rare race condition when choosing migration target sites

⚠️ You should update to this release to minimize risk of running into the race condition issue.

WikiTraccs 1.12.29

By Heinrich Ulbricht | Friday, February 23, 2024

new: adding SharePoint page modification time to “25” progress log file
improved: page migration speed
improvement: working around time zone issues that can affect the “25” progress log file, marking up-to-date pages as changed; differences that can be attributed to time zones being off are now detected and ignored
improvement: adding consistency check to check for large file uploads that were interrupted by Microsoft throttling
improvement: consolidating local file storage locations; those are now documented here in detail: File Storage
fixed: when page restrictions are narrowed down in the hierarchy of pages, dropped users and groups were not removed from the child pages

WikiTraccs 1.12.16

By Heinrich Ulbricht | Sunday, February 18, 2024

new: signed WikiTraccs binaries
new: checksum file in release package
improvement: adding additional logging to permission migration to better diagnose edge cases
fix: WikiTraccs could stop scheduling migration targets when accessing one target site fails; scheduling will now continue

WikiTraccs 1.12.6 (Prerelease)

By Heinrich Ulbricht | Thursday, February 15, 2024

Permission Migration Improvements

More cases are covered where WikiTraccs would previously give up, mainly in complex permission hierarchies.

WikiTraccs 1.12.5

By Heinrich Ulbricht | Tuesday, February 13, 2024

Google Endpoint Fix

Time for Google’s once a year change of endpoints. WikiTraccs has been adjusted to handle those.

Permission Migration Improvements

new: in a hierarchy of restricted pages, when the principals of a parent page are not all mapped, the principals that are mapped will be inherited by the child pages (within narrow boundaries)
improvement: hardened permission retrieval against connection losses
improvement: when permissions cannot (fully) be applied to a SharePoint page (e.g. due to missing user mapping) permission inheritance for the SharePoint page will be broken nevertheless
improvement: better indication when being throttled by Microsoft (in the console and log file)
fix: reverted connection timeout back and increased to 600 seconds as it otherwise interferes with throttled requests

Verification Mode

This release introduces the mode Verify page contents.

When running in verification mode WikiTraccs will go over all migrated SharePoint pages, pull the corresponding original pages from Confluence, and compare contents. This creates updated progress log files.

WikiTraccs will check the following things:

check each code snippet web part for invalid characters in the backing CanvasContent1 list item field
check if the Confluence page contains words that are missing in the SharePoint page

The results will be stored in the “25” progress log file.

In the progress log file, each page entry will get an additional state indicator that is one of the following values:

skippedverify - did not check a page; this is the state for the progress log file that is created when starting the verification run
verified - checked a page and found nothing of interest
worddiff - the Confluence page contains words that are not present in the SharePoint page
invalidcodesnippetchars - a code snippet web part contains invalid characters in the CanvasContent1 field of the page list item

But why?

The reason for the introduction of the verification mode is an incident where pages seemed broken due to invalid characters in the backing field of the page. Note that several weeks had passed since the migration.

The content of a SharePoint page is stored in a list item field called CanvasContent1, in the Site Pages library. It is HTML which needs to adhere to certain rules. One rule is, that special characters like < and > need to be encoded properly. WikiTraccs does this since its inception in 2022. Nevertheless, something changed this content for some pages with HTML code snippet web parts to contain raw < and >. That made page rendering stop there.

The verification run checks for invalid characters in the backing CanvasContent1 field and highlights the affected pages in the progress log files. An easy solution is to delete and remigrate them.

Before recognizing that page content is simply not shown we assumed that page content had gone missing, which would be a blocker (and at the same time nearly impossible, since WikiTraccs already verifies the character count of transformed pages at the time of migration). It turned out that all page content was there, it was just not being displayed. But this initial assumption resulted in the verification run to also check for missing words. Note that updates to either the Confluence or the SharePoint page can also cause this check to be triggered.

In the future, the verification run might be extended with further checks or statistics collection.

Logging Improvements

Since the number of transformation modes is growing, WikiTraccs now shows the mode it is running in:

The log file and console also show the mode:

Possible values are:

MIG - Content migration
PRINCIPALS - Update ‘Created by’ & ‘Modified by’, as well as the dates
VERIFY - Verify page contents
PERM - Permission migration

WikiTraccs.GUI also contains several improvements for its progress bars.

WikiTraccs v1.11.16

By Heinrich Ulbricht | Sunday, February 11, 2024

adjustment: default timeout for connections to SharePoint has been decreased from 100 seconds to 40 seconds - NOTE: this change exacerbated a concurrency issue that showed when being throttled by Microsoft, that will be solved in the next release
fix: memory leaks that lead to high RAM usage

Please report back if you see RAM usage that increases over a period of multiple hours and reaches amounts of > 10 GB. Note that WikiTraccs needs at least as much RAM as the largest file that will be uploaded to SharePoint.

WikiTraccs v1.11.12 (Prerelease)

By Heinrich Ulbricht | Thursday, February 08, 2024

Jira Integration Improvements

So far, there were two assumptions baked into WikiTraccs: Jira can be reached to resolve issue links and issue tables, and WikiTraccs will wait for 60 seconds for Jira to respond. This is now more flexible.

the maximum wait time for reaching out to Jira can now be configured via the appsettings.json -> WiggleRoom.JiraMaxWaitTimeSec setting; this wait time is applied each time the the Jira-related Confluence endpoint is called
set WiggleRoom.JiraMaxWaitTimeSec to -1 to disable reaching out to Jira while migrating pages from Confluence to SharePoint; this is handy when the Jira application link is no longer functional
and a bug has been fixed: issue tables that showed the fixVersion always ran into a timeout when being migrated

WikiTraccs v1.11.9 (Prerelease)

By Heinrich Ulbricht | Wednesday, January 24, 2024

Page Label Migration

Finally page labels are migrated as well. With this release WikiTraccs creates a new field Confluence: Labels (WikiTraccs) for the SharePoint Site Pages Library, that is of type Multiple lines of text, that will contain the labels for each page.

Why not migrate to managed metadata instead of text? Because it complicates things. Migrating as managed metadata would introduce dependencies to the term store, which might pull in additional stakeholders with regard to permissions, administration, and knowledge management. And what if you want to use a choice field instead? Those are decisions specific to each migration effort.

The current migration approach for page labels is to put them all into the Confluence: Labels (WikiTraccs) field, separated by the space character (which cannot occur in a label). All labels have a prefix. Valid prefixes in Confluence are “global”, “team”, and “my”.

So, for a page that has two labels foo and bar, the Site Pages library field Confluence: Labels (WikiTraccs) will have this content: global:foo global:bar.

You can use a PowerShell script to copy those values to another field of type managed metadata. And maybe clean them up a bit in the process.

Other Changes

improvement: Confluence code macro titles are now migrated as well, becoming code comments in the SharePoint Code Snippet web part

WikiTraccs v1.11.6 (Prerelease)

By Heinrich Ulbricht | Sunday, January 14, 2024

Changes

new: setting to switch off attachment sections on pages (via appsettings.json)
improved: link transformation now creates much shorter HTML for transformed links; this can help with large pages that previously ran into the platform limit of 2 MB per page
improved: cross-site attachment handling
improved: Confluence connection handling for better performance
fix: images linking to pages could lose the link
fix: file access issue in Selenium proxy
fix: attachment file names that are invalid on Windows could prevent pages from being migrated

Image transformation adjustments

TL;DR: Just for information, nothing to do.

Adjacent images are now transformed to a column of centered images in SharePoint. Adjacent image placement kind of worked for the old text web part that was used by SharePoint pages up until fall of 2023. The new text web part that Microsoft introduced cannot display adjacent images anymore, at all. No chance. So the default image transformation now shows the ugly truth and makes those images appear centered, one by one. I really hope that the SharePoint pages story evolves in the future to support adjacent images.

Converting adjacent images to single centered images is the new default behavior.

Note: You can get the old behavior back via appsettings.json:

"CustomSettings": {
  "Features": {
    "UseCkEditorv5Mode": false
  }
}

Using above setting will continue to create pages that display adjacent images until you edit a page for the first time. SharePoint’s page auto-upgrade will then re-arrange the images.

Note: A future release of WikiTraccs will introduce the option to convert adjacent images to image gallery web parts.

WikiTraccs v1.11.1

By Heinrich Ulbricht | Sunday, December 17, 2023

Improvements

new: you can set an optional Confluence authentication address so that the browser opens this address instead of the Confluence base URL
- can be used to enter an SSO opt-out URL to authenticate with a different than the current Windows user account
improved: error handling when running WikiTraccs under different Windows user accounts at the same time

WikiTraccs v1.10.24 (Preview)

By Heinrich Ulbricht | Monday, December 11, 2023

Adding alternative mode to authenticate with Confluence, to work around authentication issues. See for details: Experimental alternative to obtain cookies.

In addition:

fix: update check detects updates when version is already up-to-date
fix: Jira issue table macro not transformed if macro has no column definition
fix: settings dialog shows leftover characters when switching tabs
improved: relaxing the connection check to succeed for on-prem 401/403 results
improved: adding sanity check for config file to detect invalid source authentication modes

WikiTraccs v1.10.12

By Heinrich Ulbricht | Wednesday, December 06, 2023

Improvements

new: WikiTraccs checks for new releases
- WikiTraccs shows when a new release is available (by checking this very GitHub release page)
- the check is on by default but can be switched off in the settings
improved: the license key check is now more forgiving with regard to formatting errors and additional text in the license.txt file
fixed: code macros that need to be moved out of lists were not labeled and had their order reversed; the are now properly labeled and much easier to identify
fixed: character count metric for pages with Jira issue lists could show wrong value, despite the transformation being successful

WikiTraccs v1.10.9

By Heinrich Ulbricht | Sunday, December 03, 2023

Jira issue table snapshots - part 3

improved: links to the Jira issue list are now also added where Confluence normally omits those links (small number of issues)
improved: handling of issue lists when access is denied

Content transformation improvements

improved: bullet point lists and numbered lists don’t stick together anymore
improved: the yellow star emoji from Confluence is converted to a star instead of a circle
improved: handling of nested lists and adjacent breaks
improved: markers for de-nested tables are more descriptive

WikiTraccs v1.10.8 (Preview)

By Heinrich Ulbricht | Sunday, November 26, 2023

Jira issue table snapshots - part 2

The last preview release of WikiTraccs did not consider all Jira issue tables for transformation. This has been extended.

@-mention issues - part 2

There was a lingering user cache corruption issue that affected user mentions and could be triggered when switching WikiTraccs releases. This would make user mentions to be displayed as “User {{ID}}” instead of the user name.

I could finally identify and fix the root cause. Thank you Kees for providing me with valuable information. This is self-healing. Broken cache entries will be identified and purged.

Changes to inline images in lists

Microsoft is currently upgrading the technological basis of the text editor web part and breaks certain layouts in the process, but only when you edit pages.

Read about the background here:

With this release WikiTraccs changes the way bullet point lists and numbered lists with images are transformed.

Assume a page that has this content:

1. first point 2. second point with image <image> 3. third point

Now when editing such a page SharePoint’s auto-upgrade kicks in and modifies the content to look like this:

1. first point <image> 1. second point with image 2. third point

See how the layout and list numbering is screwed up?

Since I don’t expect Microsoft to do anything about this WikiTraccs now separates the text content and the images, like so:

🚧 Note: Some images were moved beneath the list as SharePoint does not support images in lists 1. first point 2. second point with image 🚧 See image 1.1 3. third point
<image> 🚧 Image 1.1 (moved)

This seemed like the only possible thing to do that doesn’t look too bad. Please let me know if you have better ideas. I just want to emphasize that SharePoint does not allow images to be placed in lists anymore. This is a regression in my books, but maybe it will be possible again in some future.

This change is active as of this release and affects newly migrated pages.

WikiTraccs v1.10.4 (Preview)

By Heinrich Ulbricht | Thursday, November 23, 2023

Jira issue table snapshots

WikiTraccs now takes a snapshot of Jira issue tables and puts this snapshot into the SharePoint page. The snapshot shows the content you would see when opening the page in Confluence. If the table cannot show all issues of the underlying query, a “show more link” will be added below the table. This follows the logic of Confluence.

Note: So far WikiTraccs already handled links to single Jira issues. Those were converted to hyperlinks pointing to the issue in Jira.

This is a first shot and I’m open for feedback. Does it make sense and where can it be improved? At least it seams better than a plain placeholder text, although the placeholder contained the JQL query. Should the JQL query go back into the pages? Let me know in the discussions.

@-mention issues

I got isolated reports about @-mentions not being transformed to the users’s display name, but to the Confluence user key (like “User {{long-user-key-here}}”). So far I have not been able to reproduce this, but this release contains some caching improvements that might help.

WikiTraccs v1.9.0

By Heinrich Ulbricht | Tuesday, November 07, 2023

Storing the raw Confluence page contents in SharePoint

The raw storage format XML (background) of migrated Confluence pages is now being stored in a SharePoint document library Confluence Content Snapshots (WikiTraccs). You can find this library in the WikiTraccs site, it will be automatically created with your next migration.

The raw page content can help adjusting pages later on, e.g. when new SharePoint web parts become available, and allow the development of client add-ons that conduct additional tasks.

Refer to this sample PowerShell script on how to get the storage format XML for a migrated page: get-confluence-snapshots.

Comments migration

Footer comments can now be migrated and will become content of the migrated page (in case of multiple migrated translations, they will be added to the main page). This is the first approach to tackle #15 (please chime in if you have ideas or feedback).

Notes regarding comments migration:

opt-in; you can enable comment migration in the settings, default is off
for now this covers footer comments, not inline comments
the raw storage format XML for comments is stored with the page’s in the Confluence Content Snapshots (WikiTraccs) list; this also includes inline comments, including resolved ones; you might access those comments there, if you need
migrating comments could bring pages to the maximum size of ~2MB (technical limitation of SharePoint); switch comment migration off if you experience errors

Other changes

new: adding a button to quickly jump to the WikiTraccs.GUI config file folder (to back it up or reset it)
improved: the WikiTraccs.GUI settings now properly initialize with the WikiTraccs.Console default migration settings, when being opened for the first time (e.g. blog post migration is on by default)
improved: adding hidden excerpt macros to the default Macro Ignore List
improved: the result of the Show Config as Text option is now more compatible with the appsettings.json format, that is used to configure WikiTraccs.Console

WikiTraccs v1.8.6

By Heinrich Ulbricht | Saturday, October 28, 2023

Increased visual feedback and improvements

new: buttons to quickly jump to log and update folders, directly from the blue WikiTraccs.GUI window
improved: more environment checks (log folder writable, WikiTraccs.Console blocked, connectivity for new endpoints)
improved: more visual feedback about what WikiTraccs.GUI is doing (connection check, logins, etc.)
fix: permission migration is prematurely interrupted by auto-shutdown if no content needs to be migrated
fix: progress bars for permission migration did not finish after all pages have been handled

WikiTraccs v1.8.4 (Preview Release)

By Heinrich Ulbricht | Thursday, September 21, 2023

Confluence Query Language (CQL) and Speed

new: select source pages via CQL query - see the updated documentation: Confluence Space Inventory
improved: cross-page linking to spaces, pages, and attachments has some edge-cases removed
improved: page creation and file upload speed increased up to 42%

Enabling CQL queries required changes to the page selection logic. This Preview Release lacks polishing with regard to:

labels/logs reading “space” instead of “CQL selector”
documentation needs to be updated to reflect that CQL selectors are an alternative to whole-space selectors

Actual page content migration should be unaffected. Please report any other issues you encounter.

WikiTraccs v1.7.6

By Heinrich Ulbricht | Tuesday, August 29, 2023

Transformation improvements

improved: code macro transformation maps macro properties to available settings of SharePoint code web part (language, line number toggle, theme)

Furthermore, the license key format has been unified with WikiPakk, enabling future bundling options. Soon, issued WikiTraccs license keys will need this WikiTraccs release or newer.

WikiTraccs v1.7.4

By Heinrich Ulbricht | Friday, August 25, 2023

Transformation improvements

new: WikiTraccs.Console will now auto-close after having migrated all scheduled pages
new: handling auibutton macros, converting them to links
improved: marking expand macro content with horizontal rule elements improves readability
improved: nested expand macros and panels don’t produce (de)nested tables anymore
improved: nested, invisible bullet point lists are now reduced to a single list, improving readability

WikiTraccs v1.7.1 (Update Mode Preview)

By Heinrich Ulbricht | Tuesday, August 22, 2023

New Feature: Update Mode a.k.a. Delta Migrations

With this release it is possible to update previously migrated and now outdated pages in SharePoint. Updating outdated pages might be necessary when users keep editing pages in Confluence that have already been migrated to SharePoint.

You can read everything about this new feature here: Updating previously migrated pages.

WikiTraccs already detects outdated SharePoint pages and logs them to one of its progress log files. With this release you can mark pages you want to update and feed the file right back to WikiTraccs. WikiTraccs will then migrate the marked pages again, overwriting the stale SharePoint pages.

BREAKING CHANGE: This release changes the format of the 25- progress log file. See Monitoring Confluence to SharePoint Page Migration Progress for documentation of the progress log files.

There are some caveats when it comes to renaming already migrated pages in Confluence, that are highlighted in the documentation as well.

Please give it a try and leave feedback whether this is useful for you or not, and which adjustments you would like to see.

WikiTraccs v1.6.35

By Heinrich Ulbricht | Friday, August 11, 2023

WikiTraccs.GUI usability improvements

Based on observations how folks use WikiTraccs.GUI some adjustments were made in the user interface. More stringent top-to-bottom flow, added links to the documentation, and more helpful button states.

Furthermore:

fix: Confluence Cloud connection check may fail, but shouldn’t

WikiTraccs v1.6.32

By Heinrich Ulbricht | Tuesday, August 08, 2023

fix: attachment links to other pages sometimes couldn’t be properly transformed to SharePoint

WikiTraccs v1.6.31

By Heinrich Ulbricht | Sunday, July 30, 2023

fix: Confluence login for Chrome version 115 and up (#73).

WikiTraccs v1.6.29

By Heinrich Ulbricht | Tuesday, July 04, 2023

improvement: hardening against connection errors
- page creation is now retried several times, with increasing delays
- space page retrieval and site preparation is retried as well
improvement: adding logging for throttling; when the app is throttled by Microsoft this is now visible as log message in the console while it previously could sit and wait for minutes without saying anything
improvement: lock busting for site preparation locks; when there was an issue during site preparation the stale lock is now undone after a couple of minutes; this should unblock the most common scenarios where provisioning gets stuck at this point
improvement: creating a new log file with every start of the application; this allows for easier log analysis
improvement: displaying the version number in WikiTraccs.GUI
fix: space processing stops when mapped to an inaccessible SharePoint site; this is now skipped and logged
fix: enabling/disabling of minor library versions when migrating to multiple new sites

Note: When running on an AWS EC2 instance, in a private subnet, behind a NAT gateway WikiTraccs’ connections to SharePoint are terminated frequently. This seems to be a known issue in said environment when apps use connection pooling. This release should allow migrating nevertheless as it adds retry loops for certain operations. The migration time thus will be increased in such an AWS environment.

WikiTraccs v1.6.11

By Heinrich Ulbricht | Sunday, July 02, 2023

Highlighting not-anymore merged table cells

Since SharePoint Online does not support merged table cells WikiTraccs cannot transform tables 1:1 to SharePoint. This release adds markers for table cells that once were merged.

For more details about table transformations and merged table cells refer to this blog post: How to migrate rich Confluence tables to limited SharePoint tables?

:grey_exclamation:Marking merged table cells is the new default.:grey_exclamation:

Note: If you want to revert to the former behavior of not marking merged table cells you can use appsettings.json. This page has the snippet needed to configure this: Sample Configurations

Furthermore:

fixed a bug that prevented Confluence blog posts from being migrated to SharePoint when a blog post had the same title as a page

WikiTraccs v1.6.8

By Heinrich Ulbricht | Thursday, June 22, 2023

So far WikiTraccs first got the list of pages for a space and then started migrating those pages.

Unfortunately, with increasing page count, Confluence gets slower in providing the page list (see #63). This meant so far that you had to wait quite some time until WikiTraccs started migrating a large space.

This has been changed. After retrieving the first pages for a space WikiTraccs already starts migrating those. This should save some time when migrating large spaces (> 15000 pages).

Getting the pages is done in batches, so a list of 15000 pages is not retrieved at once, but in smaller batches. The batch size so far was 100, this has been increased to 200, which seems to be a maximum. Please open a support ticket if this does not work in your environment.

Two issues around Confluence Cloud have been resolved as well, namely #28 and #65, which prevented successful login detection.

Note: This is marked as a pre-release for now because the scheduling logic changed quite a bit and will be monitored until the next release.

WikiTraccs v1.6.6

By Heinrich Ulbricht | Sunday, June 18, 2023

Promoting blog posts and adding statistics for hard links

This release has two additions to features added in the last releases.

Blog posts that have been migrated from Confluence to SharePoint are now promoted to news posts in SharePoint, so they appear in news web parts.

Hard links on Confluence pages that were detected by WikiTraccs but could not be transformed to page links in SharePoint now increase the “Inaccessible pages” and “Inaccessible links” counter that is available as field in the Site Pages library.

WikiTraccs v1.6.4

By Heinrich Ulbricht | Saturday, June 17, 2023

Transforming Confluence “hard links”

Confluence kind of monitors links between pages. When updating the title of a page Confluence takes care of updating incoming links to this page. Those links are called soft links and should be the majority of links. When migrating Confluence pages to SharePoint WikiTraccs takes care of transforming soft links to SharePoint page links.

There is another kind of link - the hard link. Those links are plain old HTML links that Confluence is oblivious of. Technically, they appear like any other text content. And also technically, they lack important metadata that is needed to locate the target page (page ID, space key). Sometimes those links make it into a page. And as long as the target page does not change much this works, until it doesn’t and they break.

With this release WikiTraccs looks out for hard links and tries to figure out the target page. If that is successful the hard link is transformed to a proper page link in SharePoint.

There are some limits to this hard link transformation: it works only for links to pages, not attachments. And it only works for links pointing to the current page version. Hard links to older page versions cannot be converted since older page versions are not present in SharePoint. Also, the link text stays as it is - so a link might look like still pointing to the source Confluence system, yet properly links to the migrated SharePoint page.

This addition should reduce the manual post-migration labor a bit. Feedback is appreciated, either as feature proposal or discussion.

WikiTraccs v1.5.0

By Heinrich Ulbricht | Thursday, June 15, 2023

Migrating blog posts!

So far the focus has been on migrating Confluence pages to SharePoint with blog posts getting not much demand. But this is something of the past.

WikiTraccs now supports migrating blog posts to SharePoint as well.

Blog post migration can be turned on in the settings dialog that is available in the main menu bar of WikiTraccs.GUI via Settings > Configure Transformation. There, choose Migrate blog posts. WikiTraccs then will pick up blog posts in your next migration.

This setting is turned off by default.

WikiTraccs 1.4.6

By Heinrich Ulbricht | Tuesday, June 13, 2023

Bugfixes and more status info in WikiTraccs.GUI

This release fixes a timezone-related bug that could mark all pages as outdated in the progress log files. Note that this only affects the log files and not page metadata.

Furthermore the progress bars showing space migration progress should now behave and properly reach their end after migrating all Confluence pages of a space to SharePoint.

WikiTraccs.GUI got a small log text field highlighting important information like “X outdated pages were found” or “All done”. This information is also available in the live console window and the log files, but it’s friendlier and quicker to have it right in the GUI.

The behavior of the space migration progress bars has been slightly changed. They will reach 100% even if outdated pages were found and subsequently skipped. But the new log text window will highlight the fact that there were outdated pages. The progress log files then provide a detailed list of those pages.

WikiTraccs 1.4.2

By Heinrich Ulbricht | Thursday, June 08, 2023

Additional Language Migration Mode

This release adds the first additional language migration mode: migrate all languages of a Confluence page to one target SharePoint page.

This is the new default setting.

This is relevant for Confluence pages that have been translated using Scroll Translations. WikiTraccs detects those languages and migrates them all, one after another, into one target SharePoint page.

If you want the old default behavior - migrate only the language with most content - you can switch it back in the settings dialog of WikiTraccs.GUI.

This release also increases the default timeout for Confluence API calls to 5 minutes, up from 30 seconds to better cope with slow responses.

WikiTraccs 1.3.14

By Heinrich Ulbricht | Tuesday, June 06, 2023

More Sanity Checks when using WikiTraccs.GUI

There is a list of nuisances that got smaller with this release:

the Confluence connection test now also checks if the API can be reached; so far this only happened when starting the migration, creating a confusing state where the connection test succeeded, but starting the migration might still fail
the SharePoint connection test now checks that the SharePoint site addresses point to a site root (subsites are not supported)
small formatting fix for the video links introduced in v1.3.13

WikiTraccs 1.3.13

By Heinrich Ulbricht | Sunday, June 04, 2023

Solving very specific Issues

This release solves two issues that cropped up in migrations.

The first issue is #51 that needs support for handling Confluence images that are in fact… videos. WikiTraccs should now detect those and apply the current default handling for videos: adding a link to the page that points to the video file in the attachments.

The second issue is #46 where it surfaced that Confluence can return duplicate page IDs when being asked for a space’s pages. WikiTraccs now de-duplicates the list of pages that it receives from Confluence which hopefully solves this issue (via #52).

Note: Of tremendous help while diagnosing those issues were WikiTraccs log files. There’s a page dedicated to those log files, where to find them, and how they help: Troubleshooting Strategies. It’s best to always send them over as described on the page.

WikiTraccs 1.3.8

By Heinrich Ulbricht | Friday, June 02, 2023

This release fixes an issue where preparing a target SharePoint site failed when running WikiTraccs on a non-English system for a site with non-English default language, with a user that has English set as default language.

WikiTraccs 1.3.7

By Heinrich Ulbricht | Monday, May 29, 2023

Reliability and Traceability Improvements

This release improves handling of Confluence user accounts that cannot be found in SharePoint. This sometimes caused WikiTraccs to create an empty placeholder page in SharePoint. This has been tuned down.

WikiTraccs will from now on log the missing account names to the Transaction Log field for affected pages and set the Check Principal Mapping flag. This allows to update editors and authors later, after configuring a proper account mapping.

Furthermore the logging of handled pages has been extended. The number and list of page IDs WikiTraccs got for each space can now be seen in the console and log. This can be used to diagnose missing pages and whether this might be due to permission configuration, or some other reason.

WikiTraccs 1.2.2

By Heinrich Ulbricht | Saturday, May 20, 2023

Macro Ignore List to exclude macros from the migration

This release of WikiTraccs introduces the Macro Ignore List.

Use the Macro Ignore List to remove certain macros from the Confluence migration to SharePoint. This is useful for macros that are only visible when editing a page in Confluence, but are visible on the SharePoint page after migration.

Read all the details in this blog post: Ignoring macros when migrating pages from Confluence to SharePoint

WikiTraccs 1.1.1

By Heinrich Ulbricht | Friday, May 19, 2023

More Migration Progress Insights

With this release WikiTraccs will generate progress log files giving more detailed insights into the migration progress.

Which pages have been migrated from Confluence to SharePoint? Which pages are left? Which pages need an update (in preparation of the update feature to come…)?

Those questions can now be answered. Have a look at the documentation on where to find the files and how to interpret them: Monitoring Confluence to SharePoint Migration Progress

GitHub Releases

Friday, September 30, 2022

Releases are available for download on GitHub.