Here's a Java program that transcribes any thread of this forum into a single file for offline reading! It processes about 260 pages a minute. Time might vary depending on connection speed and site traffic, as well as your preferences (check the menu).
The program can also be ordered to only keep the posts of certain users, great for reading only the GM's posts or finding the Toad's posts in the Future of the Fortress thread.
The program can combine multiple threads into one, arranging the posts chronologically.
If the output file ends in .html, the program will split the output into multiple pages based on user preferences (check the menu).
If the output file ends in .txt, the program will keep the output as a single plain text file, great for reading in an e-book or for processing with regular expressions or programs like grep.
Here's the program:
B12_PostProcessor.jar (10 MB)
Should work on most common OSs (Windows, Linux, Mac) and architectures (x86, x86_64) with an up to date Java VM installed.
Bugfixing
Rewrite it to make the code more manageable.
Make it understand and store more info.
Make it download multiple pages at a time to save time, since processing a single page doesn't take long. It's establishing a connection and downloading that are the real time sink.
DoneMake it able to login.
DoneMake it able to download images.
DoneMake it download theme information and images so that things don't look so black and white when offline.
Done; needs some bugfixing but nothing seriousLong term:
Make it combine multiple threads into one.
Done; needs to understand dates other than %a %d-%m-%Y, %H:%M:%SMake it prettier.
Option to divide output to multiple files.
DoneFilter options: Remove OOC (()), Only include posts with certain Text types (Italics, Bold, Underlined, Coloured, etc), Include posts with character speech "", mix and match rules
Output options:
minimize post size,
plain text output Done; maybe add a few more options and clean up outputVery Long term
Make it able to work on other forums.
Here's the source code if anyone wants to mess with it. (30 MB, Eclipse project)
Changelog:1/4/2014:New version can combine multiple files into one based on time. Unfortunately, it can only process times in the %a %d-%m-%Y, %H:%M:%S format. So make sure to log in and change your date format to that if you want to use it until I get around and fix it.
28/3/2014: Ability to split output to multiple files. New options menu. Various bugfixes.
2/11/2013: New version can save output as a "lightweight" .txt file.
1/11/2013: Fixed an out of memory error that occurred in 32-bit windows JVMs when processing more than 1069 pages
23/10/2013: New version can login and can download images and forum theme images. It also utilizes multiple downloader threads to reduce download time. Finally, it has a better GUI.
11/10/2013: New version should work on most common OSs (Windows, Linux, Mac) and architectures (x86, x86_64) with an up to data Java VM installed.
10/10/2013: Made the program create a window that acts as a terminal. This means that you can now just double click the file instead of having to launch it from the terminal. Bad thing is, it only works on 64-bit Linux now. I'll fix that tomorrow. Should be an easy fix (famous last words).