Free AI web copilot to create summaries, insights and extended knowledge, download it at here

Abstract

d over the phone like I was?</p><p id="da3a" type="7">How about this: Try to picture Ivanka, the object of her creepy father’s even creepier lust, punching a time clock! That will happen around the same time I start flying jets.</p><p id="709d">Maybe I’m being overly sensitive. I mean, if I could make my own sea salt while basking under the Mediterranean sun, wouldn’t I bleat about it later? Shit, no. I would not.</p><p id="1c29">So you know, the column’s focus was on scent and how it evokes particular memories. Here is the passage that set me off:</p><p id="d741"><i>When I was in Spain this summer, we sun-dried our own sea salt in Majorca, then went to a little shop near where we ate dinner to buy flor de sal harvested from the same Ses Salines salt flats. When I popped open the can — later back at home, my kids shouted, “it smells like Majorca!”</i></p><p id="c3f4">“Gee, kids! How cool is that? Know what? Get outta here”</p><p id="d35c">For those of us who don’t vacation in Majora, <i>flor de sal</i> means Salt Flower. Now, is it me, or is this type of self-important strutting gag-worthy?</p><p id="0c73">I’m not so offended by the message as much as I am by the way it was conveyed. As if the messenger had no clue of the disparity around her and the reality that people are struggling to make ends meet, for God’s sake. Struggling to feed themselves and their families. Working for minimum wage.</p><p id="051d">I get that this magazine is about beauty, not our country’s economy but all I can say is, the salaries must be pretty damned good.</p><p id="22b4">We, as writers, understand that words are powerful and the <i>way</i> in which we say things is as important, or maybe more so, as <i>what</i> we’re putting out into the world. I’ve learned this particular lesson the hard way. More than once.</p><p id="d5bd">Admittedly, I’m particularly sensitive in that I haven’t received an actual paycheck in almost two years. And I’m better than that. Much better, yet I can’t seem to catch a break. So, where someone else might read the editorial and think of it as “aspirational,” I think, “WTF?” Just as I do when I see TV commercials touting luxury automobiles as holiday gifts. What world are we living in?</p><p id="8d58">This is what doesn’t compute: While the editor raves about her kids raving about Majorca, there are other, less privileged children starving in this country. Their parents would love to afford a bus ticket, let alone a first-class airline ticket to Spain.</p><p id="f2ee">A little empathy for others, folks. That’s all I’m asking.</p><p id="184a">According to <i>nokidhungry.org</i>, in the United States, one in seven children lives with hungry. The bigger picture: According to the U.S. Department of Agriculture (USDA), more than eleven hundred children in our country live in “food insecure homes,” which means the family members don’t get enough to eat in order to live in a manner that’s deemed “healthy.”</p><p id="7845">Maybe the editor should set her cannister of DIY sea salt aside and chew on these stats:</p><p id="1300"><b>Over 4.5 million U.S. kids live in food deserts and lack access to grocery stores with fresh fruits and vegetables.</b></p><p id="742e"><b>On average, children in rural areas are more likely to experience food insecurity and lack access to quality health services.</b></p><p id="7f6a"><b>Close to 1 in 3 American children are overweight or obese, and obesity in children has more than tripled over the past 35 years, putting children at higher risk for serious, even life-threatening health problems.</b></p><p id="a02e"><b>In communities where Save the Children works, an average of 59 percent of children do not have access to fresh, healthy foods; in some areas, it’s as much as 98 percent.</b></p><p id="bc2d">Here’s more self-satisfied bunk from the editorial:</p><p id="c1b6"><i>In (country), last summer, my daughter and I treated ourselves one afternoon to tea at the (uber-luxe) hotel. Now, the scent of not only jasmine tea but also jasmine fragrances brings me half a world away to that fancy dining room, nibbling on tiny sandwiches

Options

and cakes.</i></p><p id="0408">Again, maybe I’m being unfair and bristly. But the manner in which this was written is offensive, in my humble opinion. Plus, the older I get, the less idiocy I can tolerate.</p><p id="712b">Maybe if she’d included some type of giveaway to the first fifty readers who wrote back via email, describing their favorite scents and what they evoked for them. Jasmine fragrance oil could be the giveaway. I don’t know.</p><p id="7d81">Perhaps this editor should stick to writing about lip conditioners and designer perfumes and the wonders of glycolic acid. Meanwhile, if the craving for a “tiny cake” should come upon her, she could always shove a Twinkie up her bum.</p><p id="444c">I’d like to thank <a href="undefined">Helen Cassidy Page</a> for her input here. She gave me the virtual slap upside the head that I needed. But, sweetly.</p><p id="6d7e"><i>Sherry McGuinn is a slightly-twisted, longtime Chicago-area writer and award-winning screenwriter. Her work has appeared in The Chicago Tribune, Chicago Sun-Times and numerous other publications. Sherry’s manager is currently pitching her newest screenplay, a drama with dark, comedic overtones and inspired by a true story.</i></p><p id="2284">As always, I appreciate your reading. If you’re up for more:</p><div id="974d" class="link-block"> <a href="https://readmedium.com/haiku-how-to-51d0685c1ad6"> <div> <div> <h2>Haiku How-To</h2> <div><h3>A primer for the sexually inquisitive.</h3></div> <div><p>medium.com</p></div> </div> <div> <div style="background-image: url(https://miro.readmedium.com/v2/resize:fit:320/1*yQwyx3SGkE3-oZlWW1dC9g.jpeg)"></div> </div> </div> </a> </div><div id="654f" class="link-block"> <a href="https://readmedium.com/did-i-fail-my-mother-3323d4907780"> <div> <div> <h2>Did I Fail My Mother?</h2> <div><h3>All the things I should have said, and didn’t.</h3></div> <div><p>medium.com</p></div> </div> <div> <div style="background-image: url(https://miro.readmedium.com/v2/resize:fit:320/1*IBboE8lKu9O0Q4Ga0aEGhQ.jpeg)"></div> </div> </div> </a> </div><div id="9067" class="link-block"> <a href="https://readmedium.com/the-hot-women-of-medium-c66515ba6bbe"> <div> <div> <h2>The Hot Women of Medium</h2> <div><h3>Smart, funny, gutsy and SMOKIN’!</h3></div> <div><p>medium.com</p></div> </div> <div> <div style="background-image: url(https://miro.readmedium.com/v2/resize:fit:320/1*sUDy3LYDjjZKQqXsMfyptQ.jpeg)"></div> </div> </div> </a> </div><div id="1a63" class="link-block"> <a href="https://readmedium.com/ive-never-received-1k-claps-b1dd0d9c56b9"> <div> <div> <h2>I’ve Never Received 1K Claps</h2> <div><h3>Wounded…and wondering.</h3></div> <div><p>medium.com</p></div> </div> <div> <div style="background-image: url(https://miro.readmedium.com/v2/resize:fit:320/1*zAfXUminR_ELCNKW8Ppsgw.jpeg)"></div> </div> </div> </a> </div><div id="11fc" class="link-block"> <a href="https://readmedium.com/its-official-i-m-an-a-hole-347624d73cd7"> <div> <div> <h2>It’s Official: I’m an A-Hole</h2> <div><h3>“Medium Madness” has me by the throat.</h3></div> <div><p>medium.com</p></div> </div> <div> <div style="background-image: url(https://miro.readmedium.com/v2/resize:fit:320/1*r4v7h4lCPyj7liblwp-GNQ.jpeg)"></div> </div> </div> </a> </div></article></body>

Exploration, Exploitation and Imperfect Representation in Deep Learning

Credit: https://unsplash.com/photos/uSPjZzYwXO4

The algorithms of learning can be coarsely abstracted as being a balance of exploration and exploitation. A balanced strategy is followed in the pursuit of a fitter representation. This representation can either be one that improves a model that is being learned or can be at the meta-level where it improves the algorithm that learns better models.

In exploitation, an automation greedily pursues a path of learning that provides immediate rewards. In exploration however, an automation must decide to forego an immediate reward and select instead a directionless exploration with the intent of discovering a greater reward elsewhere. The strategy to select one over the other is sometimes referred to as “regret minimization”. As a side, Jeff Bezos has a very human interpretation of this strategy:

“Okay, now I’m looking back on my life. I want to have minimized the number of regrets I have.” I knew that when I was 80 I was not going to regret having tried this. I was not going to regret trying to participate in this thing called the Internet that I thought was going to be a really big deal. I knew that if I failed I wouldn’t regret that, but I knew the one thing I might regret is not ever having tried. I knew that that would haunt me every day, and so, when I thought about it that way it was an incredibly easy decision.”

It is also related to the idea of Counterfactual Regret Minimization (CFR). This method is used by Libratus a poker playing machine that has bested professional players. CFR is applicable in domains with imperfect-information. In short, the strategy of selecting exploration over exploitation is relevant to domains with imperfect information.

Stochastic Gradient Descent (SGD), the workhorse learning algorithm of Deep Learning, are algorithms that employ exploitation as its fundamental motivation. SGD works only for networks that are composed of differentiable layers. Convergence happens because there will be regimes in the parameter space that guarantee convergence of iterative affine transformations. This is well known in other fields such as Control Theory (known as Method of Adjoints) as well as in Chaos theory (Iterated Function Systems).

However, exploration features are shoe-horned into classic gradient descent through different kinds of randomness. Examples of these are, the randomness in how training examples are presented, noise terms in the gradient, dropout and batch normalization. When we examine the two phases of gradient decent, we realize that the first phase is dominated by exploitation behavior. This is where we see a high signal to noise ratio, and the convergence is rapid. In this phase, second order methods that exploit the Natural Gradient (see: Fisher Information Matrix) will converge much faster. A recent method known as the Kronecker Factorization (K-FAC) that approximates the FIM has shown to exhibit 20–30 times less iterations than traditional first order methods.

In the compressive phase, exploration will dominate and randomization methods facilitate these explorations. In this regime, the gradients carry negligible information and thus the convergence is extremely slow. This is where representation compression occurs. The elusive goal of Generalization is achieved through the compression of representation. We can explore many interpretations as to what Generalization actually means, but ultimately, it boils down to the shortest expression that can accurately capture the behavior of an observed environment.

Evolutionary algorithms (aka Genetic algorithms) occupy the space of exploration approaches. In Deep Learning, evolution algorithms are usually been employed in searching for architectures. It is a more sophisticated version of hyper-parameter optimization in that instead of juggling constants like learning rates, the search algorithm juggles the composition of each layer of a network. It is used as the outer loop of the learning algorithm. The thing though about evolutionary algorithms is that serendipitous discovery is fundamental. In short, it works only when you are lucky.

Either method or a combination of both can lead to a fitter Representation. Let’s deconstruct the idea of Representations. In a previous post, I discuss 3 different dimensions of intelligences that are being developed ( computational, adaptive and social). The claim is that these are different kinds of intelligences. What is apparently obvious is that the domains in which they operate are different from each other. So form will have to follow function. The methods and architectures that are developed for each kind of intelligence are going to be different from each other. One dimension of Representation is obviously the domain in which it is applicable.

There is of course a question whether we should explore different kinds of Representations. I mean this at a more general level. In Deep Learning, there are all kinds of different neural embeddings. These embeddings are vector representations of semantics. These vector spaces are learned over the course of training. These vectors are supposed to represent an invariant form of the actual concept. One major difficulty of Deep Learning systems is that these representations are extremely entangled. The fact that they are entangled can explain why Deep Learning systems have zero conceptual understanding of what they predict. Understanding requires the creation of concepts, if concepts cannot be factored out, then what does that imply for understanding? It is important to realize, that there are many cases where understanding is not needed for competence.

I will argue that AlphaGo doesn’t understand Go in the same way as humans. Humans understand Go by creating their own concepts behind the strategies they employ. AlphaGo doesn’t have an understanding of these concepts. Rather, it has memory and the statistics of billions of moves and their consequences.

The concepts that exist as part of a Representation may exists in 3 forms. A generalizations, a prototype or an exemplar. Deep Learning focuses on creating generalizations through the capture of an invariant representation. This is why, data augmentation is a best-practice approach. So when working with images, images are rotated, cropped, de-saturated etc.. This trains the network to ignore these variations. In addition, Convolution Networks are designed to ignore image translations (i.e. difference in locations). The reason DL systems require many training sets is that it needs to “see” enough variations so that it can learn what to ignore and what to continue to keep relevant. Perhaps however that the requirement for invariances is too high and we should seek something less demanding in the form of equivariances.

In the realm of few-shot or zero-shot learning, where an automation must learn something by seeing it only once or a few times, then there is zero opportunity to discover invariances. An automation only has a few examples to create a prototype, that is representative of the entire class. So there needs to be some prior model that is capable of performing the appropriate similarity calculation. The system must know how to determine if an example is similar to a prototype.

Even worse, if its just one example, then there isn’t really a class and the system has to deduce a generalization. The implication of the latter is that, an automation requires that an internal model existing prior to any deduction.So we have here three kinds of models. A model-free representation (learned through induction), a representation for a similarity algorithm and a rich representation that can drive deduction (or abduction).

It is interesting that the human mind is able to recall unusual configurations, yet be unable to recall finer details. This is an example of attempts of people to draw the Apple logo:

Source: https://www.signs.com/branded-in-memory/

Meanwhile DL networks, that have zero understanding of an image, are much better at recreating images than average humans:

Source: http://www.evolvingai.org/files/nguyen2016synthesizing.pdf Not the Apple logo, but very accurate renditions of apples.

Not many people have the ability to visualize in their head an image and recreate it properly via a drawing. I’m unsure if this is a weakness in their articulation skills or that the details of an object aren’t actually captured by the brain. In fact, human’s are typically blind in many contexts:

It is easy to not detect the change in the two photos above http://nivea.psycho.univ-paris5.fr/

Another illustration of ‘inattentional blindness”:

The human mind simply does not have the same kind of photographic memory that a machine has. Its capacity is simply limited and requires throwing out a lot of information away so as to cope with information overload.

In a previous post, we explored how model-free and model-based cognition can be interleaved in the process of learning. Exploration and exploitation can also be interleaved in learning. However, both sets are orthogonal. As in SGD, you can have a model-free algorithm that uses both exploration and exploitation. You can also have model-based algorithms that explore or exploit. That is, there are at least three dimensions that are described here. One dimension is on the axis of exploration to exploitation. The second dimension is if the learning process is driven by a explicit model or not. The third dimension is the nature of the learned representation itself.

Bridging the semantic gap is still an extreme challenge.

**Explore Deep Learning**: Artificial Intuition: The Unexpected Deep Learning Revolution

**Exploit Deep Learning**: The Deep Learning AI Playbook