💞 FairScan > Syncthing > Paperlees-ngx
from Teppichbrand@feddit.org to selfhosted@lemmy.world on 06 Feb 06:53
https://feddit.org/post/25409295

It’s perfect! Do you guys already do this?

I open a letter, I take it’s picture with FairScan. The FairScan-folder on my android device gets syncthinged to the ingest folder for Paperless-ngx on my “server”. Paperless imports it, deletes the file and sets the new documents tag to inbox. I decide if the document goes to the binder for important stuff, or if I just toss it in a binder with all the paper I most likely will never touch again. Next time I look at Paperless, I edit all documents with the inbox tag and remove the tag.

#selfhosted

threaded - newest

ZonenRanslite@feddit.org on 06 Feb 07:02 next collapse

I have configured an email address in paperless_ngx and send everything there. The appendix is then tagged as unsorted.

diegantobass@lemmy.world on 06 Feb 07:04 next collapse

This is the transhumanism I like.

Jokes aside, how did you deal with your papers from the past? I do this with all new incoming mail, but I can’t for the life of me find the time to scan the pile of taxes, bills, medical reports, etc. that has accumulated over 3 thousand years of family life.

HelloRoot@lemy.lol on 06 Feb 08:16 next collapse

You could buy an automatic scanner that takes a stack of docs and dumps the files to a network share.

Diplomjodler3@lemmy.world on 06 Feb 09:09 collapse

Do you have any particular model in mind? What do you mean by “automatic scanner”? Any scanner I know needs quite a lot of manual preparation to scan a batch of documents.

HelloRoot@lemy.lol on 06 Feb 09:40 collapse

Epson WorkForce DS‑730N

put 100 sheets on the tray, it scans them all and either puts them all into a single pdf or multiple pdfs. Then you split / merge them in software.

Diplomjodler3@lemmy.world on 06 Feb 09:43 collapse

Not very automatic, is it? There are definitely better ways to scan high volumes of documents.

HelloRoot@lemy.lol on 06 Feb 10:14 collapse

please elaborate

Diplomjodler3@lemmy.world on 06 Feb 10:49 collapse

You can use e.g. barcodes, patch codes or separator sheets (which usually carry the patch code). Sometimes you can also separate documents by recognising some feature on the first page, e.g. a logo or a barcode that’s already there. And of course it’s a good idea to put single page documents in a separate batch so you just separate them by page count. This of course also works if all documents are two or three pages long.

HelloRoot@lemy.lol on 06 Feb 11:36 collapse

afaik you just listed features that the printer I mentioned (or if I am wrong, other similar printers) supports

it’s my bad for not mentioning all possible workflows, I was just a bit lazy and thinking of my personal documents only, which do not work well with further smart automation, because my batches are highly irregular. So the more manual approach is the best for me currently. Maybe possible with some future AI integration.

Diplomjodler3@lemmy.world on 06 Feb 12:29 collapse

Most of those features are implemented in the scan software on the PC, not on the scanner itself. Although there is a tendency to integrate more and more features in the firmware, which is not always a good idea. Also, if you’re scanning low volumes, I’d say doing the separation before the scan is generally more efficient. At least that’s how I do it. But that’s just me, of course. I wasn’t in any way trying to criticize your approach. If it works for you, it’s great.

Teppichbrand@feddit.org on 06 Feb 09:04 collapse

I took my Very Important Documents!!-folder to a neighbour with a decent scanner. It’s not that much, we scanned for an hour or two. Older, less important stuff stays in binders I most likely will never touch. If I do have to look at something from there, I snap a quick FairScan before I put it back. So it’s not about perfection, I just try to make my live easier from now on. :)

silmarine@discuss.tchncs.de on 06 Feb 07:15 next collapse

Wait, this is genius. When paperless has processed it and moves the original, it’s also removed from your phone, right?

Feddinat0r@feddit.org on 06 Feb 07:39 collapse

Yes, paperless deletes it from the import folder which is synced with syncthing, so on android the folder is also empty

avidamoeba@lemmy.ca on 06 Feb 07:27 next collapse

Welp I guess there’s another piece of software I have to setup now… 😄

tofu@lemmy.nocturnal.garden on 06 Feb 07:37 next collapse

I’m using CamScanner, but I have to share every doc to paperless. Should have a look at an automatic ingest as well…

quaff@lemmy.ca on 06 Feb 07:37 next collapse

This is awesome! Thank you for sharing this flow! 😍

Feddinat0r@feddit.org on 06 Feb 07:40 next collapse

I use an hardware scanner which then puts the pdf via ftp on the inbox folder of paperless. So i can also all old stuff

illusionist@lemmy.zip on 06 Feb 07:45 next collapse

Why don’t you use the mobile app? Mine works just fine

At least I think it does. Maybe all the docs that I upload are lost. Who knows 😅

Teppichbrand@feddit.org on 06 Feb 09:12 next collapse

Haha, thanks. I didn’t know this existed! I already use Syncthing, so that’s no problem. And I like that FairScan rotates my picture and adjusts it’s angle. If the Paperless Uploader-app does this as well, I might use it. Taking a look right now!

illusionist@lemmy.zip on 06 Feb 09:31 collapse

I use f-droid.org/…/de.astubenbord.paperless_mobile

Teppichbrand@feddit.org on 06 Feb 10:54 next collapse

Does it detect the four corners and adjusts the angle to make it flat 2D?

utjebe@reddthat.com on 06 Feb 11:50 collapse

It does, sometimes… but it is not a huge PITA.

I need to see if FairScan would do a better job.

Damage@feddit.it on 06 Feb 13:27 collapse

Last updated 2024/01/26… Hmmm…

edit: checked the wiki, here are the apps for Android

dieTasse@feddit.org on 06 Feb 19:09 collapse

Exactly what I came here to say, paperless app already have a scanner no need to fiddle with file sync and stuff 😊

Opisek@piefed.blahaj.zone on 06 Feb 08:00 next collapse

I do something similar but with the ADF scanner on my Brother printer. The scans automatically go to my server which processes them: deskew, combine with previous scan if it has the same amount of pages (because it might be me scanning the front pages first then the back pages), compress. After that, it’s put on my NAS. I left the step of importing it to paperless manual, in that I have to copy or move the file from the Scan directory on my NAS to the Paperless ingest directory. This is so that I can first check if the scan came out okay.

cron@feddit.org on 06 Feb 09:00 next collapse

Similar approach here:

  • Lexmark scanner witg ADF
  • Scan to FTP (SftpGO)
  • Paperless has the FTP folder as ingest

It doesn’t take more than 10 seconds to scan a doc this way.

JustEnoughDucks@feddit.nl on 06 Feb 11:50 collapse

I have yet to use a consumer ADF scanner on a printer that didn’t feed the paper at an angle until they are crushed and folded, doesn’t matter if the guides are perfectly set for A4 either. It has never worked for me.

llii@discuss.tchncs.de on 06 Feb 08:01 next collapse

No, I use a printer/scanner combo for scanning my files. I trigger a shell script via HomeAssistant which starts the scanner, cleans and fixes the order of the pages for two sided documents, and then puts them into the paperless folder.

But your idea is great!

RazzleDazzle@discuss.tchncs.de on 06 Feb 09:29 collapse

would you mind to elaborate more on the script? Also, what output format do you use from the scanner?

llii@discuss.tchncs.de on 06 Feb 10:01 collapse

Its a hacked together shell script. I wanted to learn shell scripting, but I should’ve probably programmed it in python.

But it basically does this:

  1. scan pages as png via scanimage in gray scale
  2. convert the image into a high contrast b/w one with image magick
  3. sort pages if I have to combine two scans (front and back side)
  4. create a single pdf of all the document pages via image magick
  5. move pdf to the paperless folder

I won’t share it because it’s really ugly and some of the functionality can be done with paperless native, like combining the pages of two scans. Which would’ve been easier.

RazzleDazzle@discuss.tchncs.de on 06 Feb 17:10 collapse

Thanks! And that’s ok, I’m more interested in the steps than the code itself. Do you apply any optimizations with image magik in step 2?

IlllIlllIlllIlllIllIIIIll@sh.itjust.works on 06 Feb 08:03 next collapse

Honestly, phone scanning is a total trap for anything you actually care about. I tried this exact setup for months and the OCR quality was absolute trash because of bad lighting or a slight tilt. If a document is important enough to keep, it is worth getting a dedicated scanner with an auto-feeder. Taking photos of letters feels like a massive chore that you will eventually just stop doing after the novelty wears off.

And let’s be real about that inbox tag. That is just a digital pile of shame. I have hundreds of documents in my own Paperless instance tagged for review that I havent touched in two years. You think you are staying organized, but you are really just moving the clutter from your desk to a database. Syncthing is the only part of this workflow that I actually trust to work every single time without failing.

Korkki@lemmy.ml on 06 Feb 08:10 next collapse

Isn’t syncthing for mobile discontinued?

zarlin@lemmy.dbzer0.com on 06 Feb 08:22 collapse

This is the currently maintained android version: github.com/researchxxl/syncthing-android

Damage@feddit.it on 06 Feb 13:05 collapse

wasn’t there some recent controversy over the change of ownership for the repo?

Cyber@feddit.uk on 06 Feb 17:33 collapse

That was all resolved.

I held off switching until very recently, but reading the githubs, etc. it’s all settled down now.

From memory it was just a bit of a nieve handover (ie “hidden” in the background, not out in public)

Nibodhika@lemmy.world on 06 Feb 08:35 next collapse

That’s a very cool idea, seems great for receipts and quick stuff.

B0rax@feddit.org on 06 Feb 08:50 next collapse

There are companion apps for paperless (like Paperparrot for iOS) that simplify that process even more.

No need for syncthing.

modeh@piefed.social on 06 Feb 12:23 next collapse

Or “Swift Paperless” on iOS too, at least that’s what I use

UndulyUnruly@lemmy.world on 06 Feb 21:40 collapse

Paperparrot is flawless. The dev is also very responsive and helpful.

They also offer a similarly flawless app called Plappa as frontend for your audiobookserver.

Running both, super happy.

JensSpahnpasta@feddit.org on 06 Feb 09:36 next collapse

The best thing about this is that it enables you to stop organizing all this paper. I have a little box next to my scanner and everything that is scanned is just getting put on top of the pile. In most cases you never need the original document anyways. But if you need it, you can check Paperless when it was scanned and you will find that document from Feb 2023 in a few seconds.

MonkeMischief@lemmy.today on 06 Feb 10:21 next collapse

Heads up if your Paperless is installed via Docker: be VERY CAREFUL about your database version and do an export often!

Mine has been down for a while because I did a pull and it doesn’t support my version of Postgresql anymore. So it’s kinda a huge mess trying to figure out how to safely migrate it in the container.

I haven’t been able to fix it yet so I’ve just left it disabled and gone without for a while. It’s not fun.

Allegedly if you export from within Paperless, you can just start with a fresh updated database container and import when this happens. Oof.

Damage@feddit.it on 06 Feb 11:12 next collapse

your version of postgres meaning that you use a db external to the docker container?

prenatal_confusion@feddit.org on 06 Feb 11:21 collapse

No sadly the upgrade path is a bit ducked. Their compose, their versions, still not working without intervention.

Damage@feddit.it on 06 Feb 11:42 next collapse

Well, crap

MonkeMischief@lemmy.today on 06 Feb 15:00 collapse

Thank you, yes, exactly what I meant. It’s a bit baffling such a mature project doesn’t have some kind of migration script. :(

aquovie@lemmy.cafe on 06 Feb 13:36 next collapse

I don’t think Postgres shouldn’t be just left as :latest or anything. At least the way I handle it, DB upgrades require manual intervention.

avidamoeba@lemmy.ca on 06 Feb 19:40 collapse

I just got burned by accidental latest tag on a pg container for Nextcloud. They moved some paths internally and it could no longer find the db.

avidamoeba@lemmy.ca on 06 Feb 17:31 collapse

This is a reminder for self-hosters to put their apps (and their data) on snapshotting filesystems with automatic, regular snapahots turned on; and fix the app versions to at least the major version, across all containers. This should bring similar disruption to bare minimum and makes recovery always possible, without relying on specific app backup features.

IanTwenty@piefed.social on 06 Feb 10:52 next collapse

Thank you for introducing me to FairScan! Great app. I have a scanner but being able to snap stuff on the go is so much quicker.

airikr@lemmy.ml on 06 Feb 12:02 next collapse

Purely awesome app if you scan a straight paper that are not cut in any way. I miss the ability to manually crop the images. Will still use the app, though.

Thanks for recommending it 🙂 Have always used a flatbed scanner for my papers (receipts, bills, etc.), but now I can scan on the go 😊

eodur@piefed.social on 06 Feb 12:20 next collapse

I haven’t used FairScan (yet!), how does it compare to OSS Document Scanner? Other than having a much much better name, obviously.

Teppichbrand@feddit.org on 06 Feb 15:03 collapse

I tried OSS Document Scanner a year ago but never really used it. It has way more features but I didn’t need any of them. With FairScan, you just take pictures, give them a name and save them either as a single PDF or a couple PNGs. Repeat if necessary.

nathan@lemmy.permisuan.com on 06 Feb 13:22 next collapse

Oh man I just set this up. I just changed to a high deductible health plan with an HSA and need to save all of my medical receipts. I already had paperless set up but i was dreading setting up some automated way of getting files over to paperless. Thank you so much!

SecurityPro@lemmy.ml on 06 Feb 13:26 next collapse

I keep having trouble getting paperless to actually injest any scans from the injest folder.

non_burglar@lemmy.world on 06 Feb 13:30 next collapse

Sounds like a great workflow!

Unfortunately, I just can’t get syncthing to run in the background of my phone without chewing up the battery.

avidamoeba@lemmy.ca on 06 Feb 17:28 collapse

You can get it to run at time intervals. E.g. once an hour for 5 minutes. That’s not bad on battery for me. I actually have mine once every 24 hours for 30 minutes so it can successfully transfer a few gigs of Signal backups.

non_burglar@lemmy.world on 06 Feb 18:27 collapse

I will look into this, thank you!

zr0@lemmy.dbzer0.com on 06 Feb 13:48 next collapse

I luckily have a professional document scanner for this, which has wifi and is able to store the PDF on a cloud drive. Started using ASN numbers recently, so I put a sticker on the letter before scanning. The letter gets archived and is found quickly if needed because of the ASN.

Yesterday I wanted to introduce syncthing as my main backup solution. However, I am struggling with setting up the discovery server and want to fix that before installing it on all my devices.

Gortus@lemmy.world on 06 Feb 18:42 collapse

I use a different app (though I’ll be checking out FairScan), but that’s pretty much exactly my workflow for things that aren’t convenient to do basically the same thing on my desktop PC & scanner.