1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
|
#+date: <2024-04-06 Sat 00:00:00>
#+title: Convert OneNote to Markdown or Org-Mode
#+description:
#+slug: convert-onenote-to-markdown
If you're looking to convert your OneNote content to another format, such as
Markdown or Org-Mode, you're in luck. I use a solution that doesn't require
other programs, such as Evernote or Notion. Personally, I used this solution on
a managed corporate laptop that doesn't allow installation of other programs
like these.
This solution uses OneNote and Pandoc on Windows 10.
* Export OneNote Content to Word
To start, export any pages or tabs from OneNote to the Word format (=.docx=):
1. Open OneNote desktop.
2. Select =File= and then =Export=.
3. Select the scope of content to export, such as =Tab= or =Page=.
4. Name and save the file in an easy to remember location. I recommend your
Downloads or Desktop folder.
See below for a visual walkthrough of the export process.
* Download Pandoc
Start by downloading Pandoc from their [[https://github.com/jgm/pandoc/releases][GitHub releases]] page. I cannot install
=.msi= files on my corporate laptop, so I downloaded the
=pandoc-3.1.12.3-windows-x86_64.zip= file, which contains a simple =.exe= file
that you do not need to install - you will simply run it from the command line
below.
Once downloaded, unzip the archive and move the =pandoc.exe= file to the same
folder where your Word documents were saved above. If you prefer, you can move
this file to an easier location, such as =C:\Users\youruser\Downloads=.
* Convert Word to Markdown
In this example, I will be converting the Word documents to Markdown, but Pandoc
supports [[https://github.com/jgm/pandoc?tab=readme-ov-file#the-universal-markup-converter][a ton of different formats for conversion]]. Choose the format you prefer
and then modify the following commands as needed.
To perform the conversion, open the Command Prompt. If you can't find it, open
the start menu and search for it.
Within the command prompt, navigate to the directory where you stored the
=pandoc.exe= file and the Word documents.
#+begin_src ps1
cd "C:\Users\yourusername\Downloads"
#+end_src
You can verify that you're in the correct directory with the =dir=
command.
#+begin_src ps1
dir
#+end_src
Once you have verified that you have the command prompt open in the correct
directory with the =pandoc.exe= and the Word documents, you can run the
following loop to convert all Word documents to Markdown.
#+begin_src ps1
for %f in (*.docx) do (pandoc.exe --extract-media=. --wrap=preserve "%f" -o "%f.md")
#+end_src
This loop will perform the following actions:
1. Find all documents matching the pattern =*.docx=, which means all Word
documents ending with that file extension.
2. Iterate through all files found in step 1.
3. For each file, perform the pandoc command.
4. Within the pandoc command, =--extract-media= saves all media found in the
files to the current folder, with pandoc automatically creating a =media=
subfolder to hold all images.
5. Within the pandoc command, =--wrap=preserve= will attempt to prseerve the
wrapping from the source document.
6. Within the pandoc command, the final step is to specify the output path with
=-o=. This option adds the =.md= file extension to recognize the output files
as Markdown files.
If you want to export to another format, simply specify the =-f==/=--from== and
=-t==/=--to== options.
For example, you can convert the Word document to org-mode. You can also convert
to one format and subsequently convert to other formats as needed.
#+begin_src ps1
pandoc.exe -f docx -t org file.docx
#+end_src
|