blog.humaneguitarist.org

discoveries in digital audio, music notation, and information encoding

Archive for the ‘opinion’ Category

on the brain: audio + ocr/hocr, “did you mean”, and “there are no ebooks”

leave a comment

Lightning talk style 'cause I'm home sick and need to get a few things out there …

Audio + OCR/HOCR

Some time ago I wrote this post on OCR/HOCR and making searchable pages. I recently did some tests with generating audio with Festival and using simple HTML5 audio to ad audio to the page. I only used Festival on the OCR output, but by using the HOCR output it's no big thing to make audio for every line that Tesseract "detects" and incorporate it with SAVS or something.

"Did You Mean?"

Google doesn't seem to offer a "did you mean" API, but you can get around it. Other options might be to use Wikipedia's API or Google's own search suggestion API (i.e. first suggestion). In both links to the API, I've sent it "disese" instead of "disease".

There are no eBooks

In digital, why the hell are we still thinking of "eBOOKS" and "digital AUDIO BOOKS", etc.?

Why can't we just think of them as web applications? And instead of having "ebook reader software" and "audio book software" why can't we just use, say, a Python/PyQT based application that uses Webkit? That's to say, I get that for monetary reasons not everything can be open, but why can't I just download a compiled script that runs Webkit and disallows me from doing things like viewing the source, saving the page, etc.? If the files need to be downloaded they can be saved in password secured ZIP files (which can only be downloaded, say, with a username/password). The Webkit app would be the only thing that could talk to a centralized DB and determine if the user still has rights to view the material, if so the DB could hand the application the password to read the contents into memory from the ZIP file, show them through the browser, etc. without making the contents readable otherwise.

What am I missing here? A lot, I'm sure. But there's got to be a better way and we need to stop thinking of digital as a bits/bytes rendering of the physical world. After all, is this an eBook or a digital audio book? I can, of course, both read and/or hear it.

In early tests, making "eBooks" readable via a Python/Webkit app is working, HTML5 audio is working on my computer but not at work, video on neither. But I *think* based on what I've read that there are some bugs with the PyQT Webkit, so maybe it's just a temp thing.

Either way, why keep inventing new software when a secure browser lets us read, listen, and watch? Adding stuff like bookmarks is a simple matter I'd think of storing data in a centralized DB for that given user account (or even a local SQLite db).

Moreover, will there even be "e-readers" and "mp3 players" in the near future? Won't it all just be a "device" of a given size that just runs a web browser?

--------------

Related Content:

Written by nitin

March 7th, 2013 at 12:11 pm

we’re all professionals … I’m not

leave a comment

There's often a direct correlation between classification and prejudice.

… I recently dropped in on a going away party for a friend/colleague of mine so naturally most people who were there work in libraries … or at least were married to such creatures.

One person asked me about my background, etc. and asked if my current job was my first "professional position". I responded along the lines of "I guess this would be considered so …" because in reality I don't like this pseudo-distinction that people in libraries try to perpetuate.

A few weeks ago, a "para-professional" asked me if I was a "librarian" and I said something like "in as much as I work in libraries … like you."

Isn't the real question here: "Do you have a Master's degree in library stuff?". If so, ask it. If not, please stop perpetuating a falseness based on the abundance of insecurity and arrogance of many of your peers.

I'm not asked to work harder than previous jobs. I wasn't allowed to screw up less in past jobs.

We're all professionals …

… but I'm not.

:P

    Written by nitin

    December 15th, 2012 at 10:43 am

    Posted in opinion

    Tagged with ,

    what I won’t do is troubleshoot you

    leave a comment

    From time to time people actually read this blog.

    From time to time, people actually comment on something I've written.

    From time to time, people sometimes even use software I've provided …

    … and from time to time, sometimes those people have questions, comments, or suggestions for improvements.

    I like to think I'm all ears, all the time.

    But sometimes, while asking for my "help", it becomes painfully obvious that some people simply want me to do something for them because they won't do it themselves.

    So, to take from a recent example, if you have questions about something and I ask you to send me specific data to help troubleshoot the issue, send me the data, specifically. Don't only send some of it.

    If I have specific questions requiring a specific type of answer, answer them specifically with the type of answer requested.

    If I ask you to name files in a certain way, do so. Don't send me a link to your Google Drive/DropBox account and tell me I can just rename them. That's not being generous, it's being inconsiderate and lazy.

    And never, but never ask me to do something for you like conduct searches on sites, download files for you, etc. especially while pretending you're somehow helping me. If we're at that point, I'm done with you.

    Oh and also, I don't care about your deadlines. If you're asking me to do things for you, I seriously doubt you care about your deadlines either.

    --------------

    Related Content:

    Written by nitin

    November 20th, 2012 at 3:13 pm

    Posted in opinion

    Tagged with , ,

    Netflix: thumbs down?

    leave a comment

    Before I crawled out of bed yesterday, I checked my email and saw a message from Netflix CEO Reed Hastings. The contents of the email are similar to this blog post by him.

    I didn't mind the recent price hike referenced in the post – it's easy for consumers to only look at things from their perspective. Offering unlimited streaming is a big freakin' deal. Add on that it works really, really well (thanks Silverlight!) and I really didn't mind it. Hell, I often eat up bandwidth leaving a movie on for streaming while I fall asleep.

    But the news that the DVD part of the business will be forked off to Qwikster is another matter. Sure, the name's stupid but more importantly the idea of not being able to simultaneously browse both the physical and streaming collection and to seamlessly manage my queues is really making me think about dropping Netflix or the impending Qwikster. But I probably won't due to two reasons:

    1. The enormous selection. I don't watch many new, popular movies and Netflix/Qwikster has a damn good selection of streaming and DVD titles for the kind of stuff I watch. I don't know if there are good alternatives out there – including Amazon, etc. – but I guess I'll now be investigating further.
    2. If Qwikster has a similarly cool API as does Netflix – er, at least in it's current form – then it's only a matter of days after Qwikster launches that someone will write an application that will let me manage both accounts from one interface.

    I guess what really bugs me is the hasty-ness of Hastings' communications. It's probably not a good idea to serve up a controversial chaser after a glib apology. And I can't help but think they're just trying to separate their DVD business so it can be sold off.

    As of this writing there are over 20k comments on the blog post, including this gem by one Bryan Thompson:

    This is great news! My dentist just did the same thing. It's so much better. Now when I have cavities on my top row of teeth I go to one dentist, and when I have cavities on the bottom row, I go to the other dentist across town. Sure it's frustrating that sometimes they can't access my dental records that used to all be in one place, and yes I admit that it seems strange that they now charge me almost twice as much for the exact same dental care I received six months ago … but they are innovators!

    Anyway, I'm going to wait and see on this one. If Qwikster doesn't have a cool API that allows for item #2 (above), then there might be a serious problem. In fact, whoever writes that app (and I'm sure there will be many variants) could potentially be saving Netflix's rear end – at least in the short term.

    When the day comes where there are no DVDs that can't be streamed – along with all special features and language/subtitle options – then Netflix's decision will make sense. But right now, I'm not sure it does.

    And yes, I probably don't have the rights to use "thumbs down". Sorry, Roger.

    Update, October 11, 2011: Well, it looks like Netflix won't be splitting into Netflix/Qwikster after all per their blog post here.

    --------------

    Related Content:

    Written by nitin

    September 20th, 2011 at 10:45 am

    Posted in opinion

    Tagged with , ,

    thinking about note entry in digital score editors

    leave a comment

    Good artists borrow, great artist steal.

    I've generally heard this quote attributed to Pablo Picasso but also to Igor Stravinsky. It's interesting to think that since they knew each other, one might have stolen it from the other. Related, I glossed over this post on About.com. I didn't read it carefully, but I agree that the quote is misunderstood. I've had discussions in the past with friends who simply invoked the quote to justify lifting parts of someone else's work in the pursuit of trying to create. But as I'd say to them, that isn't stealing, it's only borrowing. If you steal something, you make it yours, making it work so well in your image that it's no longer immediately recognizable as having belonged to another.

    Anyway, this post is about digital music notation. Or so says the title.

    I've long wondered – even going back to music school in the 1990's – about using a stylus to enter music onto a drawing tablet or what not. And I'm thinking about it more and more. I saw this thread about the possibility of MuseScore on Android tablets, etc. and using the touchscreen interface on a tablet computer as a desktop/mouse replacement, but I keep thinking about the stylus.

    Why? Because I hate entering music in score editors. I find it infinitely faster to write music with pencil and paper. I think a large part of this is that there's no translation occurring in my mind between hearing/visualizing what I need to enter and then mapping that into how that gets entered in a score editor. Arguably, some of that irritation would get reduced with practice, but not really in my case because I just don't think that graphical entry via keyboard or a note palette is the way to go.

    When I write with pencil, I usually jot down the notes for a small phrase, etc. and then add the stems in afterward. I think that way the part of my brain thinking about notating pitch and duration are slightly separated which for me, at least, makes the process faster. Sometimes I enter each pitch and its duration note-by-note, but it's not conscious and it's still fast. With digital, I find it a royal pain in the ass trying to enter pitches first and then changing their durations. Often, it seems to "break" the measure and cause encoding problems. Going the other route (entering the durations first) doesn't work either. They're both so much more work than the traditional pencil and paper routine. And I end of spending more time thinking about notation entry than Music.

    That's what I love about Lilypond. I love how I can just enter into a simple text string just the note names and then go back and add the durations – there's no digital/graphical veil. In terms of computing, it seems a more native and natural approach to use simple text entry than to try and emulate ideas from the pencil and paper world (but do it less efficiently). In fact, Lilypond is the only thing I could use to notate a score as I compose it. That's what I did with this piece for solo guitar. It would be great if MuseScore had something similar to a text-only entry box that rendered as graphical notation after-the-fact, kinda like what happens on weblily.net. I almost want to say Finale does (using some custom text notation syntax) but I can't seem to find a screenshot or anything in the online Finale manual.

    I just feel like maybe we're borrowing when we should be thinking about stealing. If we're going to pursue graphical ways of digital notation entry, I think we've got to think about being able to offer users the ability to draw their music with a stylus (more precise than a finger) and having that instantly recognized via OMR. That experience won't be too great on a tablet, but maybe one day with better digital paper and smart pens:

    What happens when you have to erase something?

    Maybe I'm just an old fogey, but for now I'll stick with pencil and paper and just using a computer to make the music more legible after the fact. There's one enormous advantage to this, too, and it's a musical one: it forces conscious consideration of revision upon me as I "re-notate" the music. Honestly, I doubt I would ever want to not use pencil and paper anyway. Music is the one thing I can do without needing a computer at all.

    --------------

    Related Content:

    Written by nitin

    September 11th, 2011 at 9:06 am

    on why I left Facebook

    leave a comment

    I recently got an email from a friend asking if I'd de-friended her on Facebook.

    Nevermind that I hate the word "de-friend" – spoken or written but especially when written without the hyphen … alas, she didn't use the hyphen.

    I hadn't removed her from anything at all. Indeed, what I removed was myself.

    I joined Facebook during library school because a lot of my cohort members were on it and also a lot of my other friends were increasingly less likely to respond to an email of mine, though I knew they actively conversed via text/SMS and social networks.

    I've never been big on texting, especially in the days where I had a really basic phone without a keyboard. Even with T9, I hated it. In fact, even on my Android I still kinda hate it. I just find texting horribly inefficient. I only use it when I know my friends don't have consistent online access.

    So anyway, social networking seemed like the only choice. Initially it was cool, people actually responding to questions of mine(!), getting "caught up" with old friends and all. But over time, it seemed nothing was happening, as if we were all in a temporal stasis.

    And then there's just Facebook itself. I increasingly found it to be obnoxious both in its functionality and its policies – and also how it reminded me of how trivial so much of the e-communication I engage in is. In fact, when I deleted my Facebook account permanently a few weeks ago, I also deleted most of my phone and email contacts.

    For sure, I'll miss some people who I may never talk to again. But missing people is part of the human experience for which some aspects of social networking are poor, poor substitutes.

    And NO, I won't start using Twitter now.

    ps: here's a tremendously interesting post entitled "Federation! (Goodbye, Google Plus)" on social networks, corporations, open-networks, etc.

    --------------

    Related Content:

    Written by nitin

    August 27th, 2011 at 11:43 am

    so long Firefox, I'm moving on

    2 comments

    I've been using Firefox pretty much from its beginning.

    I started using it when I worked at a public library in Charleston, South Carolina. One of our tech guys was really young and into all the new gadgets and technologies. He introduced me to Firefox … and I believe he even gave me my first Gmail invite.

    Around that time I also played around with the actual Mozilla Browser and K-Meleon, which ran so well on my old Pentium 3 that I was able to use that computer a year or two longer than had I used another, slower browser.

    Initially, FF was so much different – and better – than Internet Explorer that I looked past the relentless crashes. Everything crashed back then, so I couldn't hold it against FF.

    But now I can. For the last couple of years, I've become increasingly tired of FF's performance – or lack thereof. Crashes, lockups, slow start-ups, etc, etc.

    So, I'm moving on.

    I'm going with Chrome even though I think the new IE is great. I don't want to use Safari – 'cause I'd rather not directly support Apple (perhaps more on that one day).

    … and then, as far as mainstream browsers, there's Opera. It's always been a great, cutting edge browser. But there's always been one little reason I didn't use it.

    If I remember, it initially wasn't completely free. If you didn't pay, you had to see ads. I couldn't deal with that, so I moved on.

    Subsequent versions of Opera just didn't look and feel like a native Windows application. That didn't work for me either.

    But Opera totally rocks now. So why am I not using it? Because there's currently no Xmarks for Opera. I love Xmarks and if an Opera version comes along, I'll probably move to Opera. Finally.

    Back to FF.

    I think a lot of us tend to give some applications a free-pass if the program is, er, free. And if it positions itself as the noble, open-source alternative to the big, bad corporate machine … even better. I certainly can be accused of using FF and some other software more for these idiotic, faux ethical reasons than based on actual performance and productivity metrics. But I just don't have a compelling reason to use FF anymore. And I won't.

    ps: here's an interesting post entitled "How Mozilla lost the browser wars again".

    Update, August 13, 2011: After experiencing some issues where Chrome seemed unable to maintain a steady connection with my DSL service and after realizing (after I wrote the post) that I clearly wanted to use Opera, I did make the switch to Opera. While there's no Xmarks for Opera, there is Opera Link. Since I'm not really syncing bookmarks across different types of browsers and just wanted a nice bookmarks backup, Opera Link should be fine. BTW, I did have to refer to this video to get Netflix streaming working on Opera.

      Written by nitin

      August 7th, 2011 at 12:02 pm

      Posted in opinion,technophilia

      Tagged with , ,

      spherical beer bottles: on standards and standardization in the digital realm

      leave a comment

      A quick Sunday morning rant …

      I imagine it would be pretty hard to do business if I owned a small brewery but insisted on selling my beer in spherical beer bottles year round.

      That would probably create all kinds of problems in terms of packaging, delivery, and shelf space in a retail setting. Sure, I could do it if I insisted but I imagine the financial consequences for my small brewery would be too great versus just complying with the bottling norm.

      Having said that, I sometimes I get a little sick and tired of talks of "standards" in Library Land with respect to digital information. It's as if some people still have a physical-world mental barrier clouding their thinking.

      My take is that, standards aside, it's pointless to re-invent the wheel in any walk of life so, yeah, give standards a chance but if the wheel doesn't do what you want then one shouldn't hesitate to "roll their own". Moreover, I'd argue that a great many "standards" are in fact born out of that very kind of dissatisfaction with the status-quo. Reminds me of this great quote by George Bernard Shaw:

      The reasonable man adapts himself to the world; the unreasonable one persists in trying to adapt the world to himself. Therefore all progress depends on the unreasonable man.

      And isn't that the very freedom that digital information provides? Isn't that one of the points of the web – that everyone can contribute with very minimal pre-requisites?

      But it seems like some people think metadata formats and ways of doing things always have to be determined by committees.

      I just don't get that especially since the device anyone is reading this on exists in large part due to private entrepreneurship and not a bunch of people sitting around a table talking about how great the world would be if we all did things their way. Actually, that's exactly what happened, but it wasn't just a bunch of academics without the resources to transform theory into practice.

      The difference in rolling one's own is, of course, that you have to make your data comply to existing standards or practices if you want to reap the benefits of doings things the same way as others like linked data and what not.

      Big deal. If the data exists at a granular level it can be "shipped" in spherical AND regular bottles simultaneously.

      And if you're a big enough player you can make people go along with your vision and indeed ship a product in round bottles after all …

      picture of round Christmas Coke bottles

      Too bad those aren't filled with a nice IPA.

      :)

      Update, March 2012: I've often meant to update this post because I never differentiated a "standard" vs. something that's "standardized". The former emerges based on adoption and proven value over time, the latter is that which is simply documented. And I think the problem is that many people – perhaps librarians aren't any more to blame than others – confuse the latter for the former and therefore insist on usage of the latter.

      --------------

      Related Content:

      Written by nitin

      July 24th, 2011 at 11:58 am

      Posted in opinion

      Tagged with , ,

      on why search and cloud tags will ruin your dinner parties

      2 comments

      Just shooting from the hip here …

      I'm imagining a library in which I couldn't browse the collection physically by walking up and down the aisles.

      Where all I can do is approach a reference librarian and have them bring me back items they thought matched my needs based on a short "interview".

      Where I'd then assess what's before me, clarify a few things, and have them bring me back more things – only to learn that a few items I sent back are ones I now want back.

      What a friggin' mess. It's like shopping for a suit at a men's clothing store. I don't even want to think about having to do this to buy groceries. Oh, the horror …

      But isn't that what search is?

      And yet, the Kool-Aid tells us search is better than browsing.

      Bull poop.

      Implementing search and even keyword tags on a website (like this blog!) is easy to implement programmatically – a matter of some simple SQL, assuming SQL is the backend for the site's metadata. Maybe that's the real reason they're so prevalent.

      Employing some kind of extensible taxonomy to categorize the information takes more work … and more thought. For sites with a lot of content that will be used over and over for research and referential purposes, it's better if the user has both search and true browsing opportunities, says I.

      --------------

      Related Content:

      Written by nitin

      July 4th, 2011 at 10:11 am

      how not to crowdsource with Facebook

      leave a comment

      A few days ago I was included in a group email on Facebook concerning someone's desire to get some ideas regarding an upcoming interview for a librarianship position. That's fine, I don't mind. But I had no interest nor anything to add. So after a few replies kept coming in to my phone I replied to the group suggesting they move the conversation to an actual FB group or to the originator's wall.

      The response I got was, well, pretty telling.

      Basically, I was told that since the group discussion was going there wasn't a way to move the discussion to a group or a wall post and that I could remove myself from the thread with a vague reference to "message options" – options I didn't have. So I requested instructions (i.e. more information) specifically for mobile and was given a link which of course referenced options unavailable to me. So I just marked the thread as Spam and maybe that's why I haven't seen any more messages in the thread. Also, I told the person they simply had to copy/paste the replies and paste them into a FB "Note" on their wall. That way all the replies wold be preserved and people could continue to reply and be notified of updates if they participated. In other words, the claim that moving the conversation would be impossible or at least troublesome was incorrect.

      One should never force people into a conversation. Of course, sometimes we don't know if people do or don't want to participate. But we should always be prepared for the chance that some people won't want to participate and we should be prepared to make it possible for them to leave with minimal effort. Not doing so seems un-librarian like to me. Nor does it seem to convey an understanding of how to use some of the social media tools out there and the various methods and devices others might use to access social networking sites.

      Invite people in, don't show up at their door and expect them to give their time.

      The latter's not exactly a way of doing things I'd want in a new hire.

      --------------

      Related Content:

      Written by nitin

      April 2nd, 2011 at 10:00 am

      Posted in opinion

      Tagged with ,

      Switch to our mobile site