Migrating Big, Multilingual Websites From Static HTML: ICANN.org Case Study

Do you need to move a huge amount of inconsistent, legacy HTML files and associated documents into Drupal? Is the content in 14 different languages? We've done it, and we can show you how recent improvements to the fantastic Migrate module can process your old site with ease. This technique is not only useful for Migrations, but also for moving any static content into Drupal at any stage of a site's lifetime.

  • How to architect a Migration with the Migrate module.
  • Parsing static HTML using QueryPath, so that you can pull out the important information.
  • Mapping data into fields.
  • Dealing with encoding issues.
  • Importing multilingual content and linking it into Drupal translation sets.
  • Moving document files into nodes.
  • Cleaning up content.
  • Strategies for transitioning an organization.
  • Case study of ICANN.org

Mark Theunissen draws on his previous experience as the Lead Developer of Economist.com, and current position as Systems Architect and Developer on the ICANN.org migration to Drupal.

This session will give you the skills you need to confidently tackle that big migration. It's aimed at intermediate developers and architects that would like to move their legacy information to Drupal.

Speakers

Room: 
Track: 
Coding and development
Experience level: 
Intermediate
Questions answered by this session: 
How did ICANN.org migrate thousands of static files into Drupal?
What techniques are there for extracting my data from the HTML?
How have others succeeded in doing this?
How can I migrate my multilingual content?
What kind of disruption will this have on the day-to-day running of my site?
Colorado mountains