Recommendations please: Self-hosted web site analytics
from perishthethought@lemm.ee to selfhosted@lemmy.world on 22 Apr 00:48
https://lemm.ee/post/30047791

Hello y’all! I have my personal (static) website / blog running on netlify out on the public internet. Netlify, in case you’re not familiar, is not a traditional web host, so I can’t add databases or anything else like that on the server itself. Right now, that site has zero analytics / visitor tracking and I’ve decided I want to fix that. I want to know how many people visited my site and which pages they looked at. I am NOT looking to monetize anything though, to be clear.

I want to self-host that analytics service at home, on my home server, but I need two things, please:

  1. Recommendations for which app to use. I’ve checked out Umami and Plausible and they both look good for my meager purposes. But please - let me know which app makes sense for a personal web site with low-ish traffic. Is there something simpler I could do?

  2. Help getting the reverse proxy set up so my public web site can send analytics data into my home server. I would prefer this to be entirely under my control, so no CloudFlare or Tailscale, for instance. Is Caddy an option? I get really confused really quickly about this level of networking, to be clear, so maybe I just need a really plain-English guide to handling this sort of thing?

Thanks for any / all ideas! Y’all so totally rock!

ETA: A little more info about Netlify and why I can’t install or use tools other traditional web hosts might offer.

** SECOND EDIT**: Thanks to @andrew@radiation.party for the goatcounter suggestion, I am trying that out now for the analytics side of this. Getting it set up was easy and free, using their server. (I know, I know…) If I still like the app after the next couple of weeks, I will move it in-house and self-host. That gives me a couple of weeks to figure out my second issue above, how to have my public web site make requests to my self-hosted, behind the firewall/NAT service. Yay, more learning!

#selfhosted

threaded - newest

BaalInvoker@lemmy.eco.br on 22 Apr 00:54 next collapse

Im using Matomo, however my use case is different from yours. I run a business and need more complete data, which Matomo gives me

I also had in mind Plausible, cause it’s less resource demandant, but at the end I have chosen Matomo

perishthethought@lemm.ee on 22 Apr 00:59 collapse

Gotcha - thanks! Yeah, I saw Motomo listed on the selfh.st apps list but right, it seems more than I need.

selfh.st/apps/

markstos@lemmy.world on 22 Apr 10:57 collapse

I also host Matomo. It was easy to install and has been easy to maintain.

knobbysideup@sh.itjust.works on 22 Apr 01:20 next collapse

You can ship to Graylog with netcat or filebeat. Then you can do all of your graphing, searching, and analysis there.

perishthethought@lemm.ee on 22 Apr 01:51 collapse

Hey thanks, but I don’t have anything to ship to graylog. I was thinking I would get a solution which records those logs for me.

bobs_monkey@lemm.ee on 22 Apr 01:43 next collapse

Off topic, but I wonder if those old visitor counters from the web 1.0 days still work

vvv@programming.dev on 22 Apr 02:44 next collapse

yep. they’re still here. they got smaller, and we call them “tracking pixels” now.

it’s just an image, which, server side, you can count the number of times it got loaded. easy to embed and no js required.

bobs_monkey@lemm.ee on 22 Apr 03:04 next collapse

Ah I’m not talking about modern tracking pixels, but that actual html (js?) code from yesteryear

perishthethought@lemm.ee on 22 Apr 03:43 collapse

Thanks for this idea - the counting would be easy, but I don’t have anywhere to store the numbers. With my free Netlify account and my static web site, I don’t have a DB and and can’t write to a file when someone visits, at least not that I know of.

taladar@sh.itjust.works on 22 Apr 07:34 collapse

If you don’t have a DB or the ability to store data in general none of the open source analytics software will work either.

perishthethought@lemm.ee on 22 Apr 03:42 collapse

Sure but I can’t have a HTML or JS file write to any file on my web server directly, so I can’t store a number anywhere in Netlify to increment a counter.

mbirth@lemmy.mbirth.uk on 22 Apr 01:48 next collapse

My geeky web hoster provides AWStats and GoAccess. Both work by analysing Apache logfiles, so no JavaScript needs to be injected to the pages. Should be more than sufficient to get easy page tracking. (And also catches those visitors that have JavaScript disabled or tracking stuff blocked.)

perishthethought@lemm.ee on 22 Apr 03:41 collapse

My web provider doesn’t offer anything like that or what they do offer, they charge quite a lot for.

mbirth@lemmy.mbirth.uk on 22 Apr 08:06 collapse

Does your provider not give you access to the webserver log files?

TacoEvent@lemmy.zip on 22 Apr 03:14 next collapse

I spun up Plausible for my company site. Pretty straight forward honestly. It’s completely dockerized so there’s not a lot going on.

Your biggest problem is exposing the service on your home server to the internet. I personally wouldn’t recommend it, but if you know what you’re doing it’s possible.

perishthethought@lemm.ee on 22 Apr 03:40 collapse

Yes, thanks. That’s why I added my #2 above. I don’t know what I’m doing, haha.

TacoEvent@lemmy.zip on 22 Apr 05:41 collapse

Gotcha. All I can add is a vote for Plausible. It’s great!

jtp@lemmy.ml on 22 Apr 07:48 next collapse

I used Matomo for a while for a few of my non-monetized sites, and like you, I found it to be just a little too much for small projects. I switched over to Shynet a few years ago - basic, easy to understand, and self-hostable. I noticed it’s a lot more accurate than Matomo was, probably because it’s on fewer adblock lists. I wouldn’t be bothered to see my browser making requests to an instance either since it’s pretty respectful of privacy and doesn’t collect much.

joneskind@lemmy.world on 22 Apr 07:54 next collapse

Hello,

If you only want to track the number of page views you can do something simpler.

On your website, add an empty image with a known name in the footer that point to your home server (no proxy needed if your website allows loading images from external site)

You should see the attempt to download the empty image in your home server logs. There are tools to parse your server logs for specific keys (I have Logstash in mind)

If you want to track anything else, just add another empty image linked to the website event you want to track.

anguo@lemmy.ca on 22 Apr 11:28 next collapse

I’m running Umami on Railway (so not self-hosted), for two small websites. Works pretty well. I think Railway changed their pricing, but I’ve been grandfathered in with a free plan.

Edit: all of my websites are also on Netlify.

perishthethought@lemm.ee on 22 Apr 13:57 collapse

Thank you for this. I will definitely check out Railway.

ETA: I checked out Railway and their $5 a month plan is still more than I can spend rn. But I will keep them in mind for future uses.

anguo@lemmy.ca on 22 Apr 14:23 collapse

If you do go this route, the best way is to make a fork of the main Umami github repository, then link that to railway. When you want to update, you can just sync new changes to the repo, and railway will rebuild your instance.

andrew@radiation.party on 22 Apr 14:41 collapse

Umami has been pretty good to me. Plausible was a close choice but I ran into technical difficulties getting it going.

I didn’t get around to trying it, but goatcounter looked promising as well.

perishthethought@lemm.ee on 23 Apr 01:57 collapse

I’m checking out goatcounter now and so far, it looks great. Thanks for this suggestion!