Category Archives: hi tech

Future proof websites?

Warning: what follows, besides discussing 9/11, is also kind of a nerdy/geeky/technical discussion about how web pages link to each other and an idea for how to make the links between pages, especially pages that may disappear some day, work better. Maybe.

Today is Patriot Day, a “national day of service and remembrance”. Because it’s also the 14th anniversary of 9/11, I ended up reviewing my collection of 9/11 “stuff” – something I started on 9/11 and continued collecting for a few days after those events. It helped to process things a little bit, I think.

Recently, Dave Winer has been discussing, among other things, the “future-safety” of the internet:

The concern is that the record we’re creating is fragile and ephemeral, so that to historians of the future, the period of innovation where we moved our intellectual presence from physical to electronic media will be a blank spot, with almost none of it persisting.

While reviewing my collection, I realized a possible reason his piece has been percolating in the back of my mind was, in fact, this same collection. Why? Take a look – the images I’m hosting myself, since it’s not a big deal (bandwidth wise or effort wise). The links to other sites? That’s where it falls apart.

Some of the sites are still there – and one or two of the links I had still work. That’s awesome – someone thought ahead, or took the time, when they re-did the website, to make sure that the old content was still accessible.

Other links go to sites that still work, but the “layout” of the website – their URL’s and or URI’s (Uniform Resource Identifier’s) – have changed and no-one took the time to make sure it was still accessible easily. For a couple of those, I was able to find the article on the site at it’s new address, so I updated that.

Then there are two other cases left to deal with: the website is gone, or the link that I have uses a URL click-tracker service that is no more. In the case of the website being gone, I can try to use the Internet Archive (or “Wayback Machine”) to try to find the article and then figure out what to do – I could link to the archive’s version, but I decided to take that snapshot and copy it to my own server – I can’t necessarily rely on the archive to be there forever, can I? Maybe, maybe not.

In the case of the URL tracker, well, that’s going to mean some work. I can try to see if the article is available by title, but my search just now for “World reacts to calamity” returned lots of results, but none of them seem to be on the C|Net website – which is apparently either where I got the link in the first place, or where the article was hosted. That’s not helpful at all.

So what can be done? For starters, encourage the discussion. I went to Winer’s site and posted a comment:

I’ve been mulling this all over, and then realized why today. On 9/11/01, I was collecting links of things relevant to what was going on, but I only had links to the pages. I went back to my collection today, and a lot of the stuff is gone – possibly forever? I went and used the internet archive where I could for some things just now, but a lot of the content seems to be lost – especially due to click-tracking links used at the time. If only I knew then what I know now…..

Dave was quick to reply:

Yes. Today is a very good day to be thinking about that. I should write a blog post. Thanks for pointing this out.

And then he wrote a quick little piece about it: A good day to think about web history. And he has the EXACT same problem: links from that day on his own site just aren’t working.

I’ve tried to sound the alarms. Every day we lose more of the history of the web. Every day is an opportunity to act to make sure we don’t lose more of it. And we should be putting systems into place to be more sure we don’t lose future history.

There’s a solution in there somewhere, that’s for sure. For one thing, you have google, which indexes every page ever if it’s allowed to. But that’s only part of the equation – finding the data. But how? What are we going to look for? And, more importantly, where are we going to look? If a server goes offline, that data is gone unless it’s in the archive (which isn’t fee) or someone decides to mirror it (also not free). But how to make it easy to find? Some content, when you’re searching by title for example, you might find multiple sites similarly titled articles – then you have to sort the wheat from the chaff.

Is there a better way? Maybe. Off the top of my head, we need to do a little more on the backend. But what?

Mark the pages somehow with a UUID (Universally Unique Identifier). For example, it could be an SHA1 hash of data from the page – maybe the hostname as the first part, then the time, date, and article title:

agerstein.net
Future proof websites?
2015-09-11
18:04

That gets turned into: d820eab50a74ad6c0c08566b210454848a573dcf-29b6082b508b593c8de53988ef3d2b14b327664b. What do we do with that? Ideally, it’s auto generated and then put into the META data of this web page. Then, when you link to my page, the browser pulls that out of the META data (if it’s available) and adds that to the link – so instead of:
<a href="http://agerstein.net/2015/09/11/future-proof-websites/">Future Proof Websites?</a>

you get:
<a href="http://agerstein.net/2015/09/11/future-proof-websites/" webprint="d820eab50a74ad6c0c08566b210454848a573dcf-29b6082b508b593c8de53988ef3d2b14b327664b">Future Proof Websites?</a>

If you copy/paste the link for an email, or to put on Facebook/Twitter/your blog/whatever, it copies that “webprint” into the link – and if the content goes down for some reason – maybe I die and my website goes away – then a search for the webprint would make it easier to find cached/mirrored copies of the data, since the ID would theoretically go along in the cache/mirror as part of the META data in the pages.

Clearly we would need to use something better/longer than what I have here, since it’s only 81 characters long. That seems like a lot, but we’re in the process of running out of IPv4 addresses and moving everything over to IPv6 – and we didn’t think we’d run out of IPv4 addresses for quite some time back when I got into the computer game.

By having the hostname be the first part of the hash, we reduce the odds of a clash – you could, theoretically have the same second hash as another site, but what are the odds that they would have the same first hash? Impossible unless the site stole your name somehow.

All of this is moot, however, without some longevity built into the hosting. One of Winer’s bigger concern is that sites like Facebook/Twitter/etc seem to have different rules about what counts as a “post” – Tweets don’t have titles, nor do Facebook status updates. But could they? Should they? Things like this mean it’s not as easy to just move your data from one hosting solution to another. You can pull your content from Twitter, but you can’t exactly upload it to Facebook and have it work. You can pull your data from Facebook, but there seems to be so much info available to you – like what advertisements you’ve clicked – that I think you might suffer from over load trying to figure out what to move.

I agree that there should be a standard for this data – and you, as an author/content provider/social media user – should be able to take the data from one service to another. And it should be easy – like just download from one service, suspend your account there, then upload to another and keep going, deleting your prior account when you feel comfortable. But that’s not how things are set up right now. Silos, it would seem are another part of the problem. But there’s a way around that – host it yourself. But then we get to the rub there: what if you die? What if the web server dies? How do you perpetuate your online self after pass on?

Discuss.

Why is it so hard to upgrade???

I’m trying to upgrade my cell phone. I’ve been to 2 stores, both who have specials listed on their website, only to be turned away both times.

iphone5cI have an iPhone 4, and would like to upgrade to an iPhone 5c. I’m not super picky about the color, but I’m avoiding pink and blue, since that’s what my daughter and wife have. They “upgraded” about 5 months before me for Christmas (and because they were eligible) – I’m just late to the party.

Both BestBuy and Walmart list, on their websites, that if you’re willing to commit to a 2 year contract (I am) and want an iPhone 5c in 32gb (I do), you can get one for $49. Neither store is making it easy.

Yesterday was BestBuy #300, in Orange, CT. The website listed a deal: Agree to a 2 year contract and get the iPhone 5c 32gb model for $49. Considering the 16gb model is $99, it’s a great deal – so why not? The website says you have to go to the store to buy it, so I went, expecting it to take a while. It did.

First they tried to set me up with white – as I said, I’m not super picky at this point. The one he started to upgrade me to, however, was the 16gb model. The guy went in back twice to try to find what I wanted, and couldn’t. He then started looking online and couldn’t find anything.

After 45 minutes, I left to go finish running my errands. When I got home, I started hitting their website, trying to figure out if I could find a store with this phone in stock. The closest I could find was near Bar Harbor, Maine – 8 hours from here. From Trenton, NJ, to Rochester, NY and up the coast all the way to Maine. Ridiculous, and not that helpful.

I called their 800 number and lodged a complaint – if you’re going to push it on the website, fine, but put in a note that there’s limited availability or something. Shockingly, the page is no longer available today.

Todays trip was to Walmart, where they have the same deal on their website. In store, the wait was much shorter, and I ended up with a slightly different problem: they have the 32gb, but it’s only in white, and it’s $149. And they don’t have it. The staff member suggested that maybe it’s an online only deal, or I have to order it and have it shipped to the store. I just got home and confirmed: the only way to buy it is in the store.

I just called another store in this area, and they said that they don’t even carry the 5c in 32 gb – so it would seem that the website is just plain wrong.

A quick trip to the Verizon website says that I get to pay $99 for the same phone if I want to get it due to the “Mothers Day” Sale they have now – so I guess it means that instead of saving myself some money, I get to give more to Verizon. And let’s not forget their $30 “upgrade” fee for activating a new phone on an existing line of service.

In the end, I just got myself the green iPhone 5c for $99 before taxes and fees. I’m also going to recycle my old phone, which will apparently cover the cost of the new phone – so I’m only out the $30 upgrade fee and taxes.

Remember, there’s three certainties in life: death, taxes, and apparently upgrade fees.

Cell phones: my brief history

I was digging through my desk at work when I found an old cell phone, complete with charger. I have no idea where it came from (well, 2006, but whatever) nor why I have it. Well, I know what I’m going to do with it: Abby’s going to be getting a phone at Christmas, and this might fix the bill for her very modest needs: making phone calls and sending text messages.

This got me to thinking – almost everything the carriers offer these days are “smart” phones. We have a family plan with 3 iPhones and one regular phone on it. I think that moving forward, the carriers want to do everything they can to make that an all smart phone bill, since they get more money…

What it also made me do is think about the phones that I’ve had along the way. I humbly present a history of my phones, along with some comments on each.

motorola-t720 My first cell phone, on the Cingular network. Cingular had roll over minutes, and was a pretty decent carrier. The phone was pretty cool – I had a camera with me all the time, so that was fun. This was the start of my frustrations, however, with provider settings on cell phones. I also learned the T9 keyboard, and can still type pretty fast using that.


motorola-krzr-k1m We switched to Verizon, and this was my choice. It had an MP3 player, so I got to experiment with making MP3’s that fit on a memory card and would play in the phone. It was a stable phone, a bit of a work horse. It also managed to stand up to more than a few drops and not so nice encounters with gravity.


motorola-rival-ofc By far, this was my favorite cell phone. It was a “bar” phone – genius name, right? They keyboard slide out, but you could still make/take calls with it when it was closed. I think I could even tap out text messages using the T9 keyboard. The problem with this phone was that the OS was Verizon-ified. I had freezes, and the phone would constantly tell me the battery was dying, and at one point, the screen inverted all the colors AND reversed itself – so I was looking in a negative through a mirror. Despite all this, I liked the phone.


Pantech Jest This was the phone I switched to to get away from the Rival – I wanted to try a new manufacturer. Too bad I got stuck with the Verizon OS again. It was a good phone too – slide out keyboard, so you can use it for calls and stuff, or go all crazy with the typing. I got this phone literally the day we left to drive to Disney in 2008, so when I wasn’t driving, I was trying to figure out how to use it. This is also, probably, the start of when I stopped trying to use a different camera to take photos of things.

They say that the best camera is the one you have with you… as the phones have gotten better, so have the cameras. Now that I have the iPhone, I take most of my photos on that – it doesn’t help that the little digital SLR we have always complains that the battery needs to be charged. But that’s a story for another day…

GPS tagged photos: should you be panicking?

fb-link-to-kyeosI recently saw a post on several friends FaceBook pages, all going to the same website, with the same headline: “WARNING!!!! If you take photos with your cell phone“. Clicking the link brings you to a website with a warning about the dangers of posting photos from your cell phone on social media sites. The article has since been removed, but it’s available in Google’s cache if you really want to read it. It’s more of a dire warning to watch a video from the NBC affiliate KHSB in Kansas City, MO and spread the word.

My issue is that, while it’s true that the photos on your phone do include the data they are mentioning, it’s easy for ANYONE to find that information (it’s not limited to “hackers”), and most social websites (Facebook and Twitter at least) remove that data when the photos are shared.

Here, for example, is a photo I took today when getting off the highway. There’s three copies: the one I emailed myself, the one I posted to FaceBook and the one I posted to Twitter.
photo-exif photo-facebook photo-twitter

If you take a minute to save the first one to your hard drive and open it with a program that can read the Exif (Exchangeable Image File Format) data, you can find my location (latitude and longitude) when I took the photo. On the Mac, just open the images in Preview, which comes with all Macs – if you have a PC, you can do a Google search to find something to read it. In Preview, do Command + I (or go to the “Tools” menu and choose “Show Inspector”) and the info window will appear. Select the second tab, then the GPS tab in the second row and you’ll get something like this:
photo-exif-exif

I then copied and pasted the Latitude and Longitude:
Latitude: 41° 15′ 18″ N
Longitude: 73° 0′ 0″ W

You could trim that down and copy/paste it into Google Maps: 41° 15′ 18″ N,73° 0′ 0″ W. You’ll end up with a pin approximately where you were when you took the photo. I was getting off the highway.
exif-GPS

What about Facebook or Twitter? Here’s the info that the same image, uploaded to their servers, presents when saved locally:
photo-facebook-exifphoto-twitter-exif

In other words, when the photos were uploaded, they removed the GPS data.

Is this a perfect system? Not really. If you uploaded images before they started to remove the GPS data, it’s possible that the data is still there. For example, when I first heard about this, in February 2012, I did some searches to see what I could find.

I ended up writing an email to a specific “friend” I met on Twitter, and who happened to have some photos that had GPS data embedded:

First of all, I’m _not_ a stalker, simply a fan.

I was sent an email today, ostensibly for parents, but really, just for anyone who should be thinking before posting things online: YouTube link.
(the gist: if you take photos on your smart phone, when you post them online, the geo-tagged info might still be in your photos).

I first went to Facebook to see if some photos that had recently been posted by people I knew had geotag info – nothing. I think that Facebook actually strips out the EXIF data, which I guess is good.

Next I went to twitter and started looking for photos posted there by people I follow, and struck out again.

I expanded my search again, and found, after looking at a photo you Instagrammed, that one of the photos you posted on your twit pic contained the data [link provided but removed].

To get the data, I saved the file to my desktop, opened it in Preview on the Mac, then brought up the Inspector (under the tools menu).

It shows:
Latitude: 40° xx’ xx” N
Longitude: 73° xx’ xx” W

There’s also a handy “Locate” button, which opens a browser to: here

Which includes a street address:
621-699 W 40th St
Manhattan, NY 10018

Again, I’m _NOT_ stalking you. I just need to be clear on this. OK? You just happened to be the first person that I found who was sharing photos that had the geotag data in it.

Anyway, I went to one of the websites mentioned in the YouTube video and it turns out that it’s well known that TwitPic doesn’t scrub the data:
http://icanstalku.com/

And instructions on how to disable it, so that even if the website doesn’t scrub the data, it won’t be there:
http://icanstalku.com/how.php

Anyway, just thought you should be aware – I’ll be sending out some similar letters to friends and family.

Adam
(again, really not a stalker)

I ended up never sending it – it felt too stalkerly – but the video I linked to then is the same video!

There are ways to turn the GPS data embedding off if you want, just do a web search for “(your phone type) and disable gps tagging”.

I still check my uploaded photos every month or so, just to make sure – better safe than sorry!

Silence!

I made this ringtone for iPhones (and anything else that supports Apples .m4a or .m4r MPEG-4 audio formats) so that I have a “final option” when someone annoys me. How does it work? Set their contact to use a custom ringtone of silence and you won’t know when they are calling. I’ve been planning to do it for a while, but now I can.

The plan is this:
* Butt dial me too many times? Silence!
* Over use your “friend” status and call me on my personal phone for work related things? Silence!
* Irritate me for the 10th time in an hour to ask me something I already told you? Silence!

Fun for the whole family.


Silence