Smoking Heaps of Machinery

A local, friendly and composable utility belt for investigators.

Download for macOS
(coming soon)

Disclaimer

Smokingheaps is in its early stages of development. It is likely that things may not work as expected. Please refer to the FAQ for more information. Additionally, feel free to reach out for any further details. Contact information can be found below.
Smokingheaps summarizes text.

local

Most tooling requires complex infrastructure and operational support. Smokingheaps is designed to run on your local machine first and respects the privacy and security of your data.

friendly

The world of data investigation is already complex enough. Smokingheaps prioritizes simplicity and puts investigators at the center of attention.

composable

Combine various utilities and create data workflows that are greater than the sum of their parts. Rearrange workflows to fit your investigation.

Smokingheaps is a collection of small utilities that help to discover small and medium sized datasets.

As journalists, we have a multitude of tools explicitly designed for handling and processing data and documents at our disposal. Despite their overall usefulness, a common challenge with most of these tools is that they involve intricate, often technically demanding deployments. This requirement typically implies a need for costly server infrastructure and operation. Further compounding this, these tools are constructed focusing on specific scale datasets. As such, they are difficult when applied to flexible, ad-hoc tasks or less formal collaborations.

Instead of relying on remote servers that require internet access and data transmission, Smokingheaps is designed to work predominantly local first. This design choice provides the dual advantage of immediate responsiveness and enhanced data security. As a desktop application, Smokingheaps functions on your personal computer and guarantees no data is ever transferred offsite to a remote server. This underpins a more secure operation, crucial for journalistic work where data protection can be paramount.

In an experimental direction, Smokingheaps is exploring the integration of machine learning algorithms in natural language processing (NLP) to enhance its capabilities further. It will aid journalists in everyday tasks but might also be helpful to excavate hidden trends or patterns. However, the real breakthrough lies in running these machine learning models directly on the user's computer — maintaining the core ethos of data security and privacy.

Features

  • Made with investigators in mind.
  • A graphical desktop application that runs on your computer.
  • Private and secure.
  • Natural language processing such as summarization or translation of text.
  • Free and open source.

Frequently asked questions

Many of the utilities work really slow. How can I improve performance?

Machine learning tasks can be improved by using the graphics chip instead of the regular processor. The GPU/Neural Engine in modern Apple hardware (M1 based Macs) can be utilized for this purpose. However, the current machine learning library lacks complete support for it. Efforts are being made to fully support M1 based Mac hardware.

To enable extended machine learning support, follow these steps:

  1. Launch a terminal.
  2. Set the CPU as a fallback when the graphics chip is not yet supported.
  3. Launch Smokingheaps.

Open a terminal and enter the following two commands:

export PYTORCH_ENABLE_MPS_FALLBACK=1
/Applications/Smokingheaps.app/Contents/MacOS/Smokingheaps
Does Smokingheaps work on Linux or Windows?

Currently, only macOS is supported by Smokingheaps. However, I plan to offer an installer for Linux and Windows in the future. Please note that these platforms have not been tested yet. If you're feeling adventurous, you can visit the code repository and attempt to build a version for your platform on your own.

Contact and Support

General questions

Please don't hesitate to get in touch with me if you have any inquiries, feedback, or suggestions. You can reach me via email at christo.buschek@proton.me. Thank you for your interest in Smokingheaps. I look forward to hearing from you!

Contact me

Bug reports

If you encounter any issues or bugs with Smokingheaps, or want to suggest a new feature, please let me know by filing an issue on GitHub. Your report helps me improve Smokingheaps. Thanks for helping me make Smokingheaps better!

Report an issue